pdfbox-users mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Peter Murray-Rust <pm...@cam.ac.uk>
Subject Re: Fwd: Apache in 2018 - By The Digits
Date Wed, 09 Jan 2019 09:16:16 GMT
My congratulations to Tilman and the wider PDFBox team. I shall be tweeting
expansively about this as I have started a series about our thanks to Open
Source projects.
I think I first came across PDFBox about 8 years ago and have been on the
mailing list since then. (I'm a user, advocate, and build and deploy Open
tools based on PDFBox and I'll mail about this separately.). I'm not a
committer.

When I started with PDFBox it was a sad necessity because people author
documents in PDF  rather than semantic systems such as XML (which I helped
develop). PDFBox was fairly young and perhaps 1 mail a week. Much of the
code had been developed by Ben Litchfield (I don't know the history) and
Ben deserves many thanks for creating a core that could be sustained and
developed. (Please correct inaccuracies). Reading PDF in the wild is an
awful business as there are so many different ways of creating documents
which display the same photons to sighted humans but which have bizarre
bits. Much of the correspondence and issues are about a wide range of
arcane details. I don't think reading PDF will be a completely solved
problem any time soon.

There is a strong informal team at the centre of PDFBox which has kept it
going strongly for a decade. The team are helpful and respond rapidly,
courteously and constructively. When necessary (PDFBox2) they are not
afraid to massively refactor and are patient about timescales.

P.

I shall write separately about my current project AMI which aims to extract
all the contents of scientific publications in PDF and make them fully
semantic.

On Wed, Jan 9, 2019 at 5:20 AM Sally Khudairi <sk@apache.org> wrote:

> Thanks, Tilman --I just finished reading the article a few minutes ago.
> I'm crazy about it :-)
>
> Great job, everyone!
>
> - - -
> Vice President Marketing & Publicity
> Vice President Sponsor Relations
> The Apache Software Foundation
>
> Tel +1 617 921 8656 | sk@apache.org
>
>
> On Tue, Jan 8, 2019, at 23:58, Tilman Hausherr wrote:
> > https://www.cbronline.com/feature/apache-top-5
> > Meet the Apache Software Foundation’s Top 5 Code Committers
> >  (They’re powered by chocolate and Oolong tea…)
> >
> >
> > Tilman**
> >
> > Am 01.01.2019 um 12:40 schrieb Andreas Lehmkühler:
> >> Hi,  Sally prepared some digits for 2018 and I was surprised to see
> >> one of our fellow PDFBox committers among the Top 5 committers as we
> >> are a small community compared to other ASF projects.  Thanks Tilman
> >> for your ongoing efforts to improve PDFBox in the last year, the time
> >> before that and hopefully in the future!!!  A happy new year to
> >> everyone  Cheers, Andreas   -------- Ursprüngliche Nachricht --------
> >> Von: Sally Khudairi <sk@apache.org> Gesendet: 1. Januar 2019 08:22:25
> >> MEZ An: Apache Announce List <announce@apache.org> Betreff: Apache in
> >> 2018 - By The Digits  [this announcement is available online at
> >> https://s.apache.org/Apache2018Digits ]  It's been a great year for
> >> the Apache community at-large. With nearly 200M lines of code under
> >> the ASF's stewardship, our ongoing success is the result of community-
> >> led development "The Apache Way", executed through the collaborative
> >> efforts of more than 300 Apache projects and their communities.
> >> Highlights include:  Apache Projects —https://projects.apache.org/ -
> >> Total number of projects + sub-projects - 328 (not including Apache
> >> Labs initiatives) - Top-Level Projects - 198 - Podlings in the Apache
> >> Incubator - 51 - Other groups, including operations/support - 62
> >> Community/People —http://home.apache.org/ - Apache Committers - 7,032
> >> (6,693 active) - ASF Members (individuals) - 730 - New Members
> >> elected - 44   Apache Projects/Code —
> >> https://projects.apache.org/statistics.html  3,208 Apache Committers
> >> changed 78,493,228 lines of code over 201,220 commits. We also
> >> welcomed 4,638 new code contributors and 15,861 new issue/pull
> >> request contributors.  Top 5 Apache Code Committers - Andrea
> >> Cosentino (2,508 commits; 237,224 lines changed) - Jean-Baptiste
> >> Onofré (2,098 commits; 1,208,851 lines changed) - Duo Zhang (1,956
> >> commits; 809,085 lines changed) - Mark Thomas (1,823 commits; 179,883
> >> lines changed) - Tilman Hausherr (1,736 commits; 81,940 lines
> >> changed)  Top 5 Apache Project Repositories by Commits - Hadoop -
> >> HBase - Beam - Camel - Flink  Top 5 Apache Project Repositories by
> >> Size (Lines of Code) - OpenOffice (7,822,699) - NetBeans (7,741,506)
> >> - Flex (whiteboard: 5,233,722; SDK 3,933,522) - Mynewt
> >> (documentation: 4,381.072) - Hadoop (3,881,797)  "If it didn't happen
> >> on-list, it didn't happen." —https://lists.apache.org/  - Total
> >> number of mailing lists 1,131 - 19,435 authors sent 1,497,005 emails
> >> on 505,793 topics  Top 5 most active Apache user@ mailing lists -
> >> Flink - Lucene - Ignite - Cassandra - Kafka  Top 5 most active Apache
> >> dev@ mailing lists - Beam - Ignite - Kafka - Tomcat - James
> >> Contributor License Agreements and Software Grants —
> >> https://www.apache.org/licenses/  We welcomed an average of 387 new
> >> code contributors and 1,250 new people filing issues each month.
> >> Individuals who are granted write access to the Apache repositories
> >> must submit an Individual Contributor License Agreement (ICLA).
> >> Corporations that have assigned employees to work on Apache projects
> >> as part of an employment agreement may sign a Corporate CLA (CCLA)
> >> for contributing intellectual property via the corporation.
> >> Individuals or corporations donating a body of existing software or
> >> documentation to one of the Apache projects need to execute a formal
> >> Software Grant Agreement (SGA) with the ASF.  - ICLAs signed - 831 -
> >> CCLAs signed - 35 - Software Grants submitted - 25  Sponsorship and
> >> Individual Support —http://apache.org/foundation/contributing.html
> >> Thank you to our hundreds of individual donors and Sponsors whose
> >> generous support helps offset the ASF's day-to-day operating expenses
> >> that include Infrastructure, Accounting, Fundraising, Marketing &
> >> Publicity, and more.  - Platinum: Cloudera, Comcast, Facebook,
> >> Google, LeaseWeb, Microsoft, Oath, Pineapple Fund, and Tencent Cloud.
> >> - Gold: Anonymous, ARM, Bloomberg, Handshake, Hortonworks, Huawei,
> >> IBM, Indeed, Pivotal, and Union Investment.  - Silver: Aetna, Alibaba
> >> Cloud Computing, Baidu, Budget Direct, Capital One, Cerner, Inspur,
> >> ODPi, Private Internet Access, Red Hat, and Target.  - Bronze:
> >> Airport Rentals, Best VPN, The Blog Starter, Bookmakers, Cash Store,
> >> Casino Bonus, Casino2k, Cloudsoft, Emerio, Footprints Recruiting,
> >> HostChecka.com, HostingAdvice.com, HostPapa Web Hosting, The Linux
> >> Foundation, Mobile Slots, Mutuo Kredit AG, Online Holland Casino, RX-
> >> M, SCAMS.info, Site Builder Report, Talend, The Best VPN, Twitter,
> >> and Web Hosting Secret Revealed.  ASF Targeted Sponsors provide the
> >> Foundation with contributions for specific activities or programs.  -
> >> Targeted Platinum: DLA Piper, Microsoft, Oath, OSU Open Source Labs,
> >> and Sonatype.  - Targeted Gold: Atlassian, The CrytpoFund, Datadog,
> >> PhoenixNAP, and Quenda.  - Targeted Silver: Amazon Web Services,
> >> HotWax Systems, and Rackspace.  - Targeted Bronze: Bintray, Education
> >> Networks of America, Google, Hopsie, No-IP, PagerDuty, Peregrine
> >> Computer Consultants Corporation, Sonic.net, SURFnet, and Virtru.
> >> Together, our Members, Committers, contributors, users, supporters,
> >> and sponsors continue to build on our mission of providing Open
> >> Source software for the public good and are helping keep Apache
> >> software accessible to everyone.  Wishing you the best in 2019!  # #
> >> #  NOTE: you are receiving this message because you are subscribed to
> >> the announce@apache.org distribution list. To unsubscribe, send email
> >> from the recipient account to announce-unsubscribe@apache.org with
> >> the word "Unsubscribe" in the subject line.
>
> >>>
>
>
>
>

-- 
Peter Murray-Rust
Reader Emeritus in Molecular Informatics
Unilever Centre, Dept. Of Chemistry
University of Cambridge
CB2 1EW, UK
+44-1223-763069

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message