From vmehta94 at gmail.com Mon Oct 1 05:22:10 2018 From: vmehta94 at gmail.com (Vinayak Mehta) Date: Mon, 1 Oct 2018 14:52:10 +0530 Subject: [PSF-Community] Python library to extract data tables from PDF files In-Reply-To: References: Message-ID: Thanks Vasudev! [1] xtopdf looks great! will check it out. [2] I've faced similar issues w.r.t.junk characters, which may happen when the PDF contains an incorrect ToUnicode map, though I still have to dig deeper and I'm not 100% sure. I've also faced an issue where duplicate strings are assigned to the same cell. You can check it out on Github . I suspect that since PDF is a canvas-based model and not a text-based one, like you said, text is just transposed a bit further to make it look like bold text. I'll probably write a detailed blog post about the issues I faced while development :) Thanks for checking it out! On Sat, Sep 29, 2018 at 1:26 AM Vasudev Ram wrote: > Very interesting, and congrats, Vinayak. > > As a person interested in both PDF generation [1] and PDF text > extraction [2], I'm interested to know what issues you faced w.r.t. > accuracy of text extraction and also formatting. > > [1] I'm the creator of xtopdf, a Python toolkit for PDF generation > from other file formats; > > http://slides.com/vasudevram/xtopdf > > http://bitbucket.org/vasudevram/xtopdf > > [2] I worked on a project to extract text from PDF files. It was done > using a C library (xpdf), though, not a Python one. However, the text > extraction accuracy issues (some of which are technical issues > inherent in the PDF format, according to the vendor of xpdf, Glyph and > Cog) are language-independent. There were things like characters > getting transposed, missing characters, junk characters sometimes, > etc. (I also wrote a heuristics program to detect some such issues, > but that too could only reject the bad extracts, not make them > correct.) > > So the extraction was not 100% accurate, at least in my project. Also, > like I said, that vendor said the issues are inherent in PDF, partly > related to it being a canvas-based model, not a text-based one. > > I'll try to check out your project some time later. > > Cheers, > Vasudev > -- > vi quickstart: https://gumroad.com/l/vi_quick > Web site: https://vasudevram.github.io > Blog: https://jugad2.blogspot.com > Products: https://gumroad.com/vasudevram > > > While Tabula either gives either good output or fails miserably, Camelot > > gives you complete control over the extraction process with various > > configuration parameters! You can check out this section of the README > > for more > > information. Camelot also lets you plot various geometries like detected > > lines, intersections, tables in the PDF to debug and improve table > > extraction! You can check out this part of the documentation > > < > https://camelot-py.readthedocs.io/en/latest/user/advanced.html#plot-geometry > > > > for more information on that. > > > > >>>> Hello everyone! > >>>> > >>>> I recently released a Python library which lets users extract data > >>>> tables out of PDF files, my first open source library! Here's the > link: > >>>> https://github.com/socialcopsdev/camelot > >>>> > >>>> I've created a wiki page > >>>> < > https://github.com/socialcopsdev/camelot/wiki/Comparison-with-other-PDF-Table-Extraction-libraries-and-tools > > > >>>> comparing it to other open source PDF table extraction tools. I'm > >>>> currently > >>>> working on porting it to Python3! > >>>> > >>>> I would be really grateful if you could check it out and see if its > >>>> useful to you and give me any feedback that may help me improve it, by > >>>> replying here, opening an issue or a pull request! > >>>> > >>>> Looking forward to hearing from you all! > >>>> > >>>> Thanks for your time! > >>>> > >>>> Vinayak > >>>> > -------------- next part -------------- An HTML attachment was scrubbed... URL: From vmehta94 at gmail.com Mon Oct 1 05:24:50 2018 From: vmehta94 at gmail.com (Vinayak Mehta) Date: Mon, 1 Oct 2018 14:54:50 +0530 Subject: [PSF-Community] Python library to extract data tables from PDF files In-Reply-To: References: Message-ID: Thanks Alvaro! rows looks top-notch, I'll check it out! I too have support for extracting tables from images on my roadmap, will drop by the rows gitter channel to discuss this further! :) On Sat, Sep 29, 2018 at 1:40 AM ?lvaro Justen [Turicas] < alvarojusten at gmail.com> wrote: > Hi, Vinayak! Good work, thanks for sharing. :) > > I'm the creator of the rows library[http://turicas.info/rows] and > implemented PDF support early this year (with 3 different strategies) > -- it's not released on PyPI yet since I'm fixing some bugs before > releasing the next version, but you can try it out by installing: > > pip install > git+https://github.com/turicas/rows.git at feature/plugin-pdf#egg=rows > pdfminer.six cached-property > > It's 100% written in Python and also has a command-line interface (so > you can run `rows convert http://example.com/file.pdf > newfile.(csv|xls|xlsx|html|sqlite)` or even `rows query "SELECT * FROM > table1 WHERE some_condition" http://example.com/file.pdf > --output=result.xls`). > > The idea behind the extraction algorithms is to be flexible, so you > can plug your own if you want (depending on how the PDF is created, > the objects will be very different and you cannot use the same > ordering/grouping strategy). > > I'm now implementing support to extract tables from images (and also > from PDFs with images), but it's probably not going to the next > version since I need a better OCR tool. What do you think in joining > efforts so we can have better libraries? I'm going to test the PDFs > you've cited with my code so we can compare better. Feel free to > contact me directly or join the chat at https://gitter.im/turicas/rows > > Cheers, > ?lvaro Justen "Turicas" > turicas.info / @turicas (twitter, github, youtube) > +55 41 999 311 221 > On Fri, Sep 28, 2018 at 11:43 AM Vinayak Mehta wrote: > > > > Hello everyone! > > > > I recently released a Python library which lets users extract data > tables out of PDF files, my first open source library! Here's the link: > https://github.com/socialcopsdev/camelot > > > > I've created a wiki page comparing it to other open source PDF table > extraction tools. I'm currently working on porting it to Python3! > > > > I would be really grateful if you could check it out and see if its > useful to you and give me any feedback that may help me improve it, by > replying here, opening an issue or a pull request! > > > > Looking forward to hearing from you all! > > > > Thanks for your time! > > > > Vinayak > > _______________________________________________ > > PSF-Community mailing list > > PSF-Community at python.org > > https://mail.python.org/mailman/listinfo/psf-community > -------------- next part -------------- An HTML attachment was scrubbed... URL: From vasudevram at gmail.com Mon Oct 1 11:26:24 2018 From: vasudevram at gmail.com (Vasudev Ram) Date: Mon, 1 Oct 2018 20:56:24 +0530 Subject: [PSF-Community] Python library to extract data tables from PDF files In-Reply-To: References: Message-ID: >Thanks Vasudev! NP. >[1] xtopdf looks great! will check it out. Cool! Thanks. >[2] I've faced similar issues w.r.t.junk characters, which may happen when the PDF contains an incorrect ToUnicode map, though I still have to dig deeper and I'm not 100% sure. I've also faced an issue where duplicate strings are assigned to the same cell. You can check it out on Github. I suspect that since PDF is a canvas-based model and not a text-based one, like you said, text is just transposed a bit further to make it look like bold text. I'll probably write a detailed blog post about the issues I faced while development :) Good idea :) >Thanks for checking it out! NP. -- vi quickstart: https://gumroad.com/l/vi_quick Web site: https://vasudevram.github.io Blog: https://jugad2.blogspot.com Products: https://gumroad.com/vasudevram From marybeth at sixfeetup.com Mon Oct 8 15:55:33 2018 From: marybeth at sixfeetup.com (MaryBeth Okerson) Date: Mon, 8 Oct 2018 19:55:33 +0000 Subject: [PSF-Community] PyData Indy Conference This Friday Message-ID: <6FEECFE8-5D33-4AD3-9BA5-47810BFD0A03@sixfeetup.com> For the first time, PyData comes to Indy! An educational program of NumFOCUS, PyData brings together users and developers of data analysis tools. On Oct. 12, IndyPy is partnering with NumFOCUS to bring you 9 experts in Big Data management. Don?t miss your chance to hear from industry experts on hot topics like: Data Visualization IoT Data Pipelines Natural Language Processing Big Data Authentication Machine Learning Ad-Hoc Analyses Tensorflow Models Speakers come from a wide array of small and large organizations: LifeOmic, State of Indiana, Leaf Software Solutions, DMC Insurance, Bastian Solutions, IUPUI, and Six Feet Up. This is the first PyData event in Indy: don't miss it! PyData Indy Fri. Oct. 12 | 8:30a - 5p LaunchFishers Spots are filling up. Register at http://indypy.org/pythology Thank you, MaryBeth Okerson Marketing & Event Coordinator -- Six Feet Up, Inc. Python & Cloud Expert Consulting Direct Line: +1 (317) 861-5948 Email: marybeth at sixfeetup.com https://sixfeetup.com -------------- next part -------------- An HTML attachment was scrubbed... URL: From dinaldo at gmail.com Sat Oct 13 18:22:22 2018 From: dinaldo at gmail.com (Don Sheu) Date: Sat, 13 Oct 2018 15:22:22 -0700 Subject: [PSF-Community] Crisis Incidents In-Reply-To: References: Message-ID: Hi all, to give the short update, the member in crisis is alive. They are participating in a channel dedicated to emotional support on our Slack. They did repeat the promise to die by suicide subsequent to the first message. Maelle on of our organizers with prior experience as a nurse working with psychiatry interceded with the member in crisis. Part of our subsequent discussions amongst the organizers of PuPPy included tactics to limit mental health loading on our organizers. Unfortunately like many topics that deserve attention and a plan, we haven't completed any concrete guidelines or a policy on handling incidents in the future. I do want to keep at this one. Will share if we produce anything that's helpful. ? On Mon, Sep 24, 2018 at 1:01 PM Ian Stapleton Cordasco < graffatcolmingov at gmail.com> wrote: > I would always try to refer them to trained crisis counselors, e.g., > CrisisTextline. > > If they won't reach out, you can text in for coaching on how best to help > them > > Sent from my phone with my typo-happy thumbs. Please excuse my brevity > > On Mon, Sep 24, 2018, 14:46 Don Sheu wrote: > >> Recently, we as organizers dealt with somebody on Slack in distress. This >> created a lot of tough decisions on responses. Alan Vezina my fellow >> founder of PuPPy took a lead. Following some coaching from social workers, >> he made a call to 911. >> >> Wondering if other organizers have anything to share. >> >> We didn't build the Ark in this case. But now after the rain trying to >> backfill with some process so we're better prepared. >> >> -- >> Don Sheu >> 312.880.9389 >> - - - - - - - - - - - - - - - - - - >> >> >> My Python user group convenes every month 2nd Wednesdays >> http://www.meetup.com/PSPPython/events/232708762/ >> >> >> *CONFIDENTIALITY NOTICE*: *The information contained in this message may >> be protected trade secrets or protected by applicable intellectual property >> laws of the United States and International agreements. If you believe that >> it has been sent to you in error, do not read it. Please immediately reply >> to the sender that you have received the message in error. Then delete it. >> Thank you.* >> ? >> _______________________________________________ >> PSF-Community mailing list >> PSF-Community at python.org >> https://mail.python.org/mailman/listinfo/psf-community >> > -- Don Sheu 312.880.9389 - - - - - - - - - - - - - - - - - - My Python user group convenes every month 2nd Wednesdays http://www.meetup.com/PSPPython/events/232708762/ *CONFIDENTIALITY NOTICE*: *The information contained in this message may be protected trade secrets or protected by applicable intellectual property laws of the United States and International agreements. If you believe that it has been sent to you in error, do not read it. Please immediately reply to the sender that you have received the message in error. Then delete it. Thank you.* -------------- next part -------------- An HTML attachment was scrubbed... URL: From brianhray at gmail.com Mon Oct 15 11:43:32 2018 From: brianhray at gmail.com (Brian Ray) Date: Mon, 15 Oct 2018 10:43:32 -0500 Subject: [PSF-Community] Crisis Incidents In-Reply-To: References: Message-ID: Scary situation. Glad they made it through. I am sure it helped having the right person intervene. Some larger for profit organizations have volunteer nurses and hotlines. This article has some good resources too: https://www.businessinsider.com/suicide-hotline-saved-my-life-2018-8 -------------- next part -------------- An HTML attachment was scrubbed... URL: From pjf at maepub.com Mon Oct 15 11:57:44 2018 From: pjf at maepub.com (Peter Farrell) Date: Mon, 15 Oct 2018 08:57:44 -0700 Subject: [PSF-Community] Crisis Incidents In-Reply-To: References: Message-ID: Thanks for the update Don. If any documentation or guidelines get written, I'd be interested in reading them. My partner is a Doctor of Psychology so we've already chatted about this and what are the first steps for mental health first aid. On Oct 15, 2018 7:29 AM, "Don Sheu" wrote: Hi all, to give the short update, the member in crisis is alive. They are participating in a channel dedicated to emotional support on our Slack. They did repeat the promise to die by suicide subsequent to the first message. Maelle on of our organizers with prior experience as a nurse working with psychiatry interceded with the member in crisis. Part of our subsequent discussions amongst the organizers of PuPPy included tactics to limit mental health loading on our organizers. Unfortunately like many topics that deserve attention and a plan, we haven't completed any concrete guidelines or a policy on handling incidents in the future. I do want to keep at this one. Will share if we produce anything that's helpful. ? On Mon, Sep 24, 2018 at 1:01 PM Ian Stapleton Cordasco < graffatcolmingov at gmail.com> wrote: > I would always try to refer them to trained crisis counselors, e.g., > CrisisTextline. > > If they won't reach out, you can text in for coaching on how best to help > them > > Sent from my phone with my typo-happy thumbs. Please excuse my brevity > > On Mon, Sep 24, 2018, 14:46 Don Sheu wrote: > >> Recently, we as organizers dealt with somebody on Slack in distress. This >> created a lot of tough decisions on responses. Alan Vezina my fellow >> founder of PuPPy took a lead. Following some coaching from social workers, >> he made a call to 911. >> >> Wondering if other organizers have anything to share. >> >> We didn't build the Ark in this case. But now after the rain trying to >> backfill with some process so we're better prepared. >> >> -- >> Don Sheu >> 312.880.9389 >> - - - - - - - - - - - - - - - - - - >> >> >> My Python user group convenes every month 2nd Wednesdays >> http://www.meetup.com/PSPPython/events/232708762/ >> >> >> *CONFIDENTIALITY NOTICE*: *The information contained in this message may >> be protected trade secrets or protected by applicable intellectual property >> laws of the United States and International agreements. If you believe that >> it has been sent to you in error, do not read it. Please immediately reply >> to the sender that you have received the message in error. Then delete it. >> Thank you.* >> ? >> _______________________________________________ >> PSF-Community mailing list >> PSF-Community at python.org >> https://mail.python.org/mailman/listinfo/psf-community >> > -- Don Sheu 312.880.9389 - - - - - - - - - - - - - - - - - - My Python user group convenes every month 2nd Wednesdays http://www.meetup.com/PSPPython/events/232708762/ *CONFIDENTIALITY NOTICE*: *The information contained in this message may be protected trade secrets or protected by applicable intellectual property laws of the United States and International agreements. If you believe that it has been sent to you in error, do not read it. Please immediately reply to the sender that you have received the message in error. Then delete it. Thank you.* _______________________________________________ PSF-Community mailing list PSF-Community at python.org https://mail.python.org/mailman/listinfo/psf-community -------------- next part -------------- An HTML attachment was scrubbed... URL: From ewa at python.org Tue Oct 16 15:35:49 2018 From: ewa at python.org (Ewa Jodlowska) Date: Tue, 16 Oct 2018 14:35:49 -0500 Subject: [PSF-Community] 2018 Python Developer Survey Message-ID: Hi PSF members - As some of you may have seen, the 2018 Python Developer Survey is available. If you haven't taken the survey yet, please do so soon! Additionally, we'd appreciate any assistance you all can provide with sharing the survey with your local Python groups, schools, work colleagues, etc. We will keep the survey open through October 26th, 2018. https://surveys.jetbrains.com/s3/c4-python-developers-survey-2018 We?re counting on your help to better understand how different Python developers use Python and related frameworks, tools, and technologies. We also hope you'll enjoy going through the questions. The survey is organized in partnership between the Python Software Foundation and JetBrains. Together we will publish the aggregated results. We will randomly choose and announce 100 winners to receive a Python Surprise Gift Pack (must complete the full survey to qualify). Thank you, Ewa Jodlowska -------------- next part -------------- An HTML attachment was scrubbed... URL: From tjreedy at udel.edu Wed Oct 17 13:22:08 2018 From: tjreedy at udel.edu (Terry Reedy) Date: Wed, 17 Oct 2018 13:22:08 -0400 Subject: [PSF-Community] 2018 Python Developer Survey In-Reply-To: References: Message-ID: <2d86f26e-cfe6-f210-7800-db8ce6369e8d@udel.edu> The survey ends with Check the findings of our Python Developers Survey 2017. https://www.jetbrains.com/research/python-developers-survey-2017/ which links to Raw Data https://www.jetbrains.com/research/python-developers-survey-2017/#raw-data which contains Get Python Developer Survey 2017 Raw Data linked https://drive.google.com/drive/folders/1SxFwVpsRFig-h1WXfGcBlhBVdEVQElw7 which I know nothing about and cannot access. Please put the data somewhere accessible. From ntoll at ntoll.org Wed Oct 17 17:37:37 2018 From: ntoll at ntoll.org (Nicholas H.Tollervey) Date: Wed, 17 Oct 2018 22:37:37 +0100 Subject: [PSF-Community] 2018 Python Developer Survey In-Reply-To: <2d86f26e-cfe6-f210-7800-db8ce6369e8d@udel.edu> References: <2d86f26e-cfe6-f210-7800-db8ce6369e8d@udel.edu> Message-ID: I see a CSV with the raw data at the final link you quoted. Could the problem be at your end..? N. On 17/10/18 18:22, Terry Reedy wrote: > The survey ends with > Check the findings of our Python Developers Survey 2017. > https://www.jetbrains.com/research/python-developers-survey-2017/ > which links to Raw Data > https://www.jetbrains.com/research/python-developers-survey-2017/#raw-data > which contains Get Python Developer Survey 2017 Raw Data linked > https://drive.google.com/drive/folders/1SxFwVpsRFig-h1WXfGcBlhBVdEVQElw7 > which I know nothing about and cannot access. > > Please put the data somewhere accessible. > _______________________________________________ > PSF-Community mailing list > PSF-Community at python.org > https://mail.python.org/mailman/listinfo/psf-community -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 488 bytes Desc: OpenPGP digital signature URL: From betsy at python.org Fri Oct 19 11:19:48 2018 From: betsy at python.org (Betsy Waliszewski) Date: Fri, 19 Oct 2018 08:19:48 -0700 Subject: [PSF-Community] 2018 PSF Recurring Giving Campaign Message-ID: Hi all, Our last fundraiser of the year is launching on Monday! I would love your help promoting it via social media, to your friends and co-workers :-). Some companies have a matching gift program or you could offer your own "match". If anyone has a company that would be willing to offer a "match", please let me know and we can promote it. Here is the link to the campaign that will go live at 5:00 am Eastern time on Monday: https://www.python.org/psf/donations/2018-q4-drive/ Here are two tweets to share or feel free to write your own: ========================== Our Recurring Giving Campaign starts TODAY, October 22. Help us raise $30K! https://www.python.org/psf/donations/2018-q4-drive/ #idonatedtothepsf #ijoinedthepsf Contribute to our Recurring Giving Campaign & help us reach our goal of $30K. The PSF is a non-profit organization entirely supported by its sponsors, members & the public. https://www.python.org/psf/donations/2018-q4-drive/ #idonatedtothepsf #ijoinedthepsf ========================== Thanks in advance for your help! Cheers, Betsy -------------- next part -------------- An HTML attachment was scrubbed... URL: From anthony.flury at btinternet.com Sat Oct 20 07:58:52 2018 From: anthony.flury at btinternet.com (Anthony Flury) Date: Sat, 20 Oct 2018 12:58:52 +0100 Subject: [PSF-Community] Python library to extract data tables from PDF files In-Reply-To: References: Message-ID: Have you published it on pypi ? Make it really easy for people to install it if they need it. On 28/09/2018 07:31, Vinayak Mehta wrote: > Hello everyone! > > I recently released a Python library which lets users extract data > tables out of PDF files, my first open source library! Here's the > link: https://github.com/socialcopsdev/camelot > > I've created a wiki page > > comparing it to other open source PDF table extraction tools. I'm > currently working on porting it to Python3! > > I would be really grateful if you could check it out and see if its > useful to you and give me any feedback that may help me improve it, by > replying here, opening an issue or a pull request! > > Looking forward to hearing from you all! > > Thanks for your time! > > Vinayak > > _______________________________________________ > PSF-Community mailing list > PSF-Community at python.org > https://mail.python.org/mailman/listinfo/psf-community -- Anthony Flury *Email* : anthony.flury at btinternet.com *Twitter* : @TonyFlury -------------- next part -------------- An HTML attachment was scrubbed... URL: From vmehta94 at gmail.com Mon Oct 22 13:21:26 2018 From: vmehta94 at gmail.com (Vinayak Mehta) Date: Mon, 22 Oct 2018 22:51:26 +0530 Subject: [PSF-Community] Python library to extract data tables from PDF files In-Reply-To: References: Message-ID: Hi Anthony! Yes it's on PyPI! You can install it using "pip install camelot-py" Also, I just released a web interface for the library! You can check it out here: https://github.com/camelot-dev/excalibur You can install it using "pip install excalibur-py" or download the Windows/Linux executable from the releases page. Keep looking up! Vinayak On Sat, Oct 20, 2018 at 5:36 PM Anthony Flury via PSF-Community < psf-community at python.org> wrote: > Have you published it on pypi ? Make it really easy for people to install > it if they need it. > On 28/09/2018 07:31, Vinayak Mehta wrote: > > Hello everyone! > > I recently released a Python library which lets users extract data tables > out of PDF files, my first open source library! Here's the link: > https://github.com/socialcopsdev/camelot > > I've created a wiki page > > comparing it to other open source PDF table extraction tools. I'm currently > working on porting it to Python3! > > I would be really grateful if you could check it out and see if its useful > to you and give me any feedback that may help me improve it, by replying > here, opening an issue or a pull request! > > Looking forward to hearing from you all! > > Thanks for your time! > > Vinayak > > _______________________________________________ > PSF-Community mailing listPSF-Community at python.orghttps://mail.python.org/mailman/listinfo/psf-community > > -- > Anthony Flury > *Email* : anthony.flury at btinternet.com > *Twitter* : @TonyFlury > _______________________________________________ > PSF-Community mailing list > PSF-Community at python.org > https://mail.python.org/mailman/listinfo/psf-community > -------------- next part -------------- An HTML attachment was scrubbed... URL: From vasudevram at gmail.com Mon Oct 22 13:32:02 2018 From: vasudevram at gmail.com (Vasudev Ram) Date: Mon, 22 Oct 2018 23:02:02 +0530 Subject: [PSF-Community] Python library to extract data tables from PDF files In-Reply-To: References: Message-ID: Hey, Vinayak, Cool product names, man! You have my moral support. I vote for the next one being called Merlin - a wizard - get it? I must rethink better names for my own products :) Cheers ... On 10/22/18, Vinayak Mehta wrote: > Hi Anthony! > > Yes it's on PyPI! You can install it using "pip install camelot-py" > > Also, I just released a web interface for the library! You can check it out > here: https://github.com/camelot-dev/excalibur > > You can install it using "pip install excalibur-py" or download the > Windows/Linux executable from the releases page. > > Keep looking up! > > Vinayak > > On Sat, Oct 20, 2018 at 5:36 PM Anthony Flury via PSF-Community < > psf-community at python.org> wrote: > >> Have you published it on pypi ? Make it really easy for people to install >> it if they need it. >> On 28/09/2018 07:31, Vinayak Mehta wrote: >> >> Hello everyone! >> >> I recently released a Python library which lets users extract data tables >> out of PDF files, my first open source library! Here's the link: >> https://github.com/socialcopsdev/camelot >> >> I've created a wiki page >> >> comparing it to other open source PDF table extraction tools. I'm >> currently >> working on porting it to Python3! >> >> I would be really grateful if you could check it out and see if its >> useful >> to you and give me any feedback that may help me improve it, by replying >> here, opening an issue or a pull request! >> >> Looking forward to hearing from you all! >> >> Thanks for your time! >> >> Vinayak >> >> _______________________________________________ >> PSF-Community mailing >> listPSF-Community at python.orghttps://mail.python.org/mailman/listinfo/psf-community >> >> -- >> Anthony Flury >> *Email* : anthony.flury at btinternet.com >> *Twitter* : @TonyFlury >> _______________________________________________ >> PSF-Community mailing list >> PSF-Community at python.org >> https://mail.python.org/mailman/listinfo/psf-community >> > -- vi quickstart: https://gumroad.com/l/vi_quick Web site: https://vasudevram.github.io Blog: https://jugad2.blogspot.com Products: https://gumroad.com/vasudevram From vmehta94 at gmail.com Mon Oct 22 20:51:34 2018 From: vmehta94 at gmail.com (Vinayak Mehta) Date: Tue, 23 Oct 2018 06:21:34 +0530 Subject: [PSF-Community] Python library to extract data tables from PDF files In-Reply-To: References: Message-ID: Thanks for the support Vasudev! They're both themed after the Arthurian legend. https://excalibur-py.readthedocs.io/en/master/user/intro.html#what-s-in-a-name On Mon, Oct 22, 2018 at 11:02 PM Vasudev Ram wrote: > Hey, Vinayak, > > Cool product names, man! > > You have my moral support. > > I vote for the next one being called Merlin - a wizard - get it? > > I must rethink better names for my own products :) > > Cheers ... > > On 10/22/18, Vinayak Mehta wrote: > > Hi Anthony! > > > > Yes it's on PyPI! You can install it using "pip install camelot-py" > > > > Also, I just released a web interface for the library! You can check it > out > > here: https://github.com/camelot-dev/excalibur > > > > You can install it using "pip install excalibur-py" or download the > > Windows/Linux executable from the releases page. > > > > Keep looking up! > > > > Vinayak > > > > On Sat, Oct 20, 2018 at 5:36 PM Anthony Flury via PSF-Community < > > psf-community at python.org> wrote: > > > >> Have you published it on pypi ? Make it really easy for people to > install > >> it if they need it. > >> On 28/09/2018 07:31, Vinayak Mehta wrote: > >> > >> Hello everyone! > >> > >> I recently released a Python library which lets users extract data > tables > >> out of PDF files, my first open source library! Here's the link: > >> https://github.com/socialcopsdev/camelot > >> > >> I've created a wiki page > >> < > https://github.com/socialcopsdev/camelot/wiki/Comparison-with-other-PDF-Table-Extraction-libraries-and-tools > > > >> comparing it to other open source PDF table extraction tools. I'm > >> currently > >> working on porting it to Python3! > >> > >> I would be really grateful if you could check it out and see if its > >> useful > >> to you and give me any feedback that may help me improve it, by replying > >> here, opening an issue or a pull request! > >> > >> Looking forward to hearing from you all! > >> > >> Thanks for your time! > >> > >> Vinayak > >> > >> _______________________________________________ > >> PSF-Community mailing > >> listPSF-Community at python.orghttps:// > mail.python.org/mailman/listinfo/psf-community > >> > >> -- > >> Anthony Flury > >> *Email* : anthony.flury at btinternet.com > >> *Twitter* : @TonyFlury > >> _______________________________________________ > >> PSF-Community mailing list > >> PSF-Community at python.org > >> https://mail.python.org/mailman/listinfo/psf-community > >> > > > > > -- > vi quickstart: https://gumroad.com/l/vi_quick > Web site: https://vasudevram.github.io > Blog: https://jugad2.blogspot.com > Products: https://gumroad.com/vasudevram > -------------- next part -------------- An HTML attachment was scrubbed... URL: From betsy at python.org Tue Oct 23 10:45:15 2018 From: betsy at python.org (Betsy Waliszewski) Date: Tue, 23 Oct 2018 07:45:15 -0700 Subject: [PSF-Community] Recurring Giving Campaign Message-ID: Hello, fellow Pythonistas! Our last fundraiser of the year started yesterday. We need your help promoting it via social media, to your friends, and co-workers :-). Some companies have a matching gift program, so if yours does, please think about taking advantage of it. You can also create your own "match". Here is the link to the campaign: https://www.python.org/psf/donations/2018-q4-drive/ If you could share the tweet below, we'd appreciate it. ========================== Contribute to our Recurring Giving Campaign & help us reach our goal of $30K. The PSF is a non-profit organization entirely supported by its sponsors, members & the public. https://www.python.org/psf/donations/2018-q4-drive/ #idonatedtothepsf #ijoinedthepsf ========================== Thanks to everyone for your support of Python! Cheers, Betsy -------------- next part -------------- An HTML attachment was scrubbed... URL: From betsy at python.org Tue Oct 30 13:09:28 2018 From: betsy at python.org (Betsy Waliszewski) Date: Tue, 30 Oct 2018 10:09:28 -0700 Subject: [PSF-Community] 2018 PSF Recurring Giving Campaign Message-ID: Hi everyone, Just a reminder that our recurring giving campaign is live: https://www.python.org/psf/donations/2018-q4-drive/ I would love your help promoting it via social media, to your friends and co-workers :-). If your company has a matching gift program, please let me know and we can promote it. Feel free to use the following suggested tweet: Contribute to our Recurring Giving Campaign & help us reach our goal of $30K. The PSF is a non-profit organization entirely supported by its sponsors, members & the public. https://www.python.org/psf/donations/2018-q4-drive/ #idonatedtothepsf #ijoinedthepsf Thanks so much! Cheers, Betsy -------------- next part -------------- An HTML attachment was scrubbed... URL: