[BangPypers] Idea
Dorai Thodla
dorai at thodla.com
Sat Mar 8 03:07:43 CET 2008
Awesome. Great going.
I also read the interview on Spike Developer Zone.
Dorai
On Fri, Mar 7, 2008 at 9:58 PM, Anand Balachandran Pillai <
abpillai at gmail.com> wrote:
> I actually went ahead and did this today. I registered a new blog
> at http://pythonjobs.blogspot.com . It took me roughly 3 hours to
> write a custom crawler using HarvestMan to crawl monthly archives
> of bangpypers and post Jobs automatically to blogger. It uses
> the Google blogger API in gdata-python-client library.
>
> http://code.google.com/p/gdata-python-client/
>
> If someone wants to see the code of the custom crawler
> it is available in the HarvestMan-2.0 trunk.
>
>
> http://svn.eiao.net/robacc/experimental/HarvestMan-2.0/harvestman/apps/postingcrawler.py
>
> I wrote a custom blogger module by using sample code from the google
> blogger
> API. Since it contains google's code, I have not checked it into the
> subversion trunk.
> If someone wants the code, let me know.
>
> To make sure your jobs are in the Blog, just ensure that you make your
> job posts with [JOB] in the title. That is all the crawler looks for.
>
> Regards,
> --Anand
>
>
> On Fri, Mar 7, 2008 at 6:32 PM, Anand Balachandran Pillai
> <abpillai at gmail.com> wrote:
> > On Fri, Mar 7, 2008 at 6:30 PM, Anand Balachandran Pillai
> > <abpillai at gmail.com> wrote:
> > >
> > > On Fri, Mar 7, 2008 at 6:05 PM, Harish Krishnan <
> bugsy.seigel at gmail.com> wrote:
> > > >
> > > >
> > > > On 07-Mar-08, at 4:57 PM, Anand Balachandran Pillai wrote:
> > > >
> > > >
> > > > 1. Automate blog posting backend when a mail which seems to
> mention a new
> > > > job posting is posted. This can be done bye requiring specific
> keyword(s)
> > > > in
> > > > the subject for job postings such as [JOB]. I am not sure, but
> mailman
> > > > might
> > > > allow such customizations in the backend.
> > > >
> > > > Sounds like a nice idea. It would also be good if we have a policy
> for not
> > > > posting jobs directly on the mailing list else it will lead to
> duplication.
> > > >
> > > >
> > > >
> > > > 2. An incremental crawler (always!) which monitors the group for
> postings
> > > > and
> > > > automatically fetches JOB posting posts (similar approach, use
> keywords or
> > > > naive bayesian classification!) and post it to a specific blog.
> > > >
> > > >
> > > >
> > > > This is even better. what does it take for this to work?
> > > >
> > >
> > > Nothing much. Just give me half a day to create a custom crawler for
> this
> > > on top of HarvestMan :)
> > Ok, this is not posturing :) If someone can register an appropriate
> blog and
> > send me the URL and the auth credentials I will create the "job
> > posting crawler".
> > Only that someone has to bear the responsibility of running it on
> > a frequent basis.
> >
> > gnuyoga, can you do this ? It would be a nice exercise to write a
> custom
> > crawler for this...
> >
> > >
> > > > Harish
> > >
> > >
> > > >
> > > >
> > > > _______________________________________________
> > > > BangPypers mailing list
> > > > BangPypers at python.org
> > > > http://mail.python.org/mailman/listinfo/bangpypers
> > > >
> > > >
> > >
> > >
> > >
> > > --
> > > -Anand
> > >
> >
> > Thanks
> >
> > --
> > -Anand
> >
>
>
>
> --
> -Anand
> _______________________________________________
> BangPypers mailing list
> BangPypers at python.org
> http://mail.python.org/mailman/listinfo/bangpypers
>
--
Dorai Thodla (http://www.thodla.com)
US: 650-206-2688
India: 98408 89258
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mail.python.org/pipermail/bangpypers/attachments/20080308/3551dd52/attachment.htm
More information about the BangPypers
mailing list