From econdr at googlemail.com Thu May 2 16:31:49 2019 From: econdr at googlemail.com (Eoin C) Date: Thu, 2 May 2019 21:31:49 +0100 Subject: [Pandas-dev] Submitting PR for the first time Message-ID: <51E8A655-0588-4356-8842-24AE83FA22FA@googlemail.com> Hi, I would like to submit my first PR for the Pandas project. My Github user name is eoincondron and the PR is related to issue #26210. How do I get started? Regards, Eoin -------------- next part -------------- An HTML attachment was scrubbed... URL: From william.ayd at icloud.com Thu May 2 16:42:56 2019 From: william.ayd at icloud.com (William Ayd) Date: Thu, 2 May 2019 13:42:56 -0700 Subject: [Pandas-dev] Submitting PR for the first time In-Reply-To: <51E8A655-0588-4356-8842-24AE83FA22FA@googlemail.com> References: <51E8A655-0588-4356-8842-24AE83FA22FA@googlemail.com> Message-ID: <2EB06DF6-A86F-4473-A186-DEE38F18A65A@icloud.com> Hi Eoin, Thanks for the interest in contributing! The documentation houses a contributing guide which is pretty thoroughly documented; I?d suggest giving that a look: https://pandas.pydata.org/pandas-docs/stable/development/contributing.html If you have one-off questions from there feel free to reach out via Gitter. - Will > On May 2, 2019, at 1:31 PM, Eoin C via Pandas-dev wrote: > > Hi, > > I would like to submit my first PR for the Pandas project. My Github user name is eoincondron and the PR is related to issue #26210. > How do I get started? > > Regards, > > Eoin > > > > _______________________________________________ > Pandas-dev mailing list > Pandas-dev at python.org > https://mail.python.org/mailman/listinfo/pandas-dev William Ayd william.ayd at icloud.com -------------- next part -------------- An HTML attachment was scrubbed... URL: From jorisvandenbossche at gmail.com Tue May 14 07:27:39 2019 From: jorisvandenbossche at gmail.com (Joris Van den Bossche) Date: Tue, 14 May 2019 13:27:39 +0200 Subject: [Pandas-dev] Pandas development hangout - Thursday May 16 at 16:30 UTC Message-ID: Hi everyone, We're having a dev chat this week Thursday to check the status of the coming releases and other discussion items. It takes place Thursday (May 16) at 16:30 UTC (I *think* this is 9:30 Pacific / 12:30 Eastern / 17:30 UTC / 18:30 central Europe). All are welcome to attend! Calendar invite: https://calendar.google.com/event?action=TEMPLATE&tmeid=N2VscjBkZnA2OWtmMzlzMHVramVkZ25rcTkgam9yaXN2YW5kZW5ib3NzY2hlQG0&tmsrc=jorisvandenbossche%40gmail.com (will update later with video meeting link) Agenda/Minutes: https://docs.google.com/document/d/1tGbTiYORHiSPgVMXawiweGJlBw5dOkVJLY-licoBmBU/edit?usp=sharing Joris -------------- next part -------------- An HTML attachment was scrubbed... URL: From jorisvandenbossche at gmail.com Tue May 14 07:50:26 2019 From: jorisvandenbossche at gmail.com (Joris Van den Bossche) Date: Tue, 14 May 2019 13:50:26 +0200 Subject: [Pandas-dev] Pandas dev sprint June 27-30 @ Nashville Message-ID: Dear all, We are planning to do a pandas sprint end of June in Nashville (Tennessee, USA): June 27-30. We will be meeting with some of the core devs (so not a sprint to jump-start newcomers in this case), but sending this to the mailing list to invite other pandas (or related libraries) contributors. The exact planning of the sprint still needs to be discussed, but we will probably be hacking and discussing on pandas, extension arrays, next versions of pandas, etc. *So if you are interested, let me know something! *We want to keep the number of participants somewhat limited, and also need to plan the location and funding, so please state your interest *before May 30*. If you would like to participate, but not sure if you would fit at such a sprint, don't hesitate to mail me personally. Best, Joris -------------- next part -------------- An HTML attachment was scrubbed... URL: From changshe at gmail.com Tue May 14 16:59:11 2019 From: changshe at gmail.com (Chang She) Date: Tue, 14 May 2019 13:59:11 -0700 Subject: [Pandas-dev] Pandas dev sprint June 27-30 @ Nashville In-Reply-To: References: Message-ID: I'll be out of the country during those dates. Can I still join in remotely? Here's what I'd be interested in working on if there's appetite for these to be part of pandas and friends: 1. A stale PR on Series.explode I haven't had any time to finish up ( https://github.com/pandas-dev/pandas/pull/24366). 2. Open sourcing an improvement to the pandas-redshift connector that speeds up the ingestion of medium amounts of data using a combination of unload + read_csv + multiprocessing. 3. A minor improvement to allow read_parquet to work with globs directly. This makes it a lot easier for pandas to read parquet generated by Spark. On Tue, May 14, 2019 at 4:50 AM Joris Van den Bossche < jorisvandenbossche at gmail.com> wrote: > Dear all, > > We are planning to do a pandas sprint end of June in Nashville (Tennessee, > USA): June 27-30. We will be meeting with some of the core devs (so not a > sprint to jump-start newcomers in this case), but sending this to the > mailing list to invite other pandas (or related libraries) contributors. > The exact planning of the sprint still needs to be discussed, but we will > probably be hacking and discussing on pandas, extension arrays, next > versions of pandas, etc. > > *So if you are interested, let me know something! *We want to keep the > number of participants somewhat limited, and also need to plan the location > and funding, so please state your interest *before May 30*. > If you would like to participate, but not sure if you would fit at such a > sprint, don't hesitate to mail me personally. > > Best, > Joris > > > _______________________________________________ > Pandas-dev mailing list > Pandas-dev at python.org > https://mail.python.org/mailman/listinfo/pandas-dev > -------------- next part -------------- An HTML attachment was scrubbed... URL: From jorisvandenbossche at gmail.com Thu May 16 12:30:53 2019 From: jorisvandenbossche at gmail.com (Joris Van den Bossche) Date: Thu, 16 May 2019 18:30:53 +0200 Subject: [Pandas-dev] Pandas development hangout - Thursday May 16 at 16:30 UTC In-Reply-To: References: Message-ID: I updated the event with the video link: Hangout: https://meet.google.com/cmr-mtsx-khs Phone: ?+1 727-258-2318? PIN: ?701 610 593?# Op di 14 mei 2019 om 13:27 schreef Joris Van den Bossche < jorisvandenbossche at gmail.com>: > Hi everyone, > > We're having a dev chat this week Thursday to check the status of the > coming releases and other discussion items. It takes place Thursday (May > 16) at 16:30 UTC (I *think* this is 9:30 Pacific / 12:30 Eastern / 17:30 > UTC / 18:30 central Europe). > All are welcome to attend! > > Calendar invite: > https://calendar.google.com/event?action=TEMPLATE&tmeid=N2VscjBkZnA2OWtmMzlzMHVramVkZ25rcTkgam9yaXN2YW5kZW5ib3NzY2hlQG0&tmsrc=jorisvandenbossche%40gmail.com > > (will update later with video meeting link) > > Agenda/Minutes: > https://docs.google.com/document/d/1tGbTiYORHiSPgVMXawiweGJlBw5dOkVJLY-licoBmBU/edit?usp=sharing > > Joris > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From maxhalford25 at gmail.com Tue May 21 12:15:50 2019 From: maxhalford25 at gmail.com (Max Halford) Date: Tue, 21 May 2019 18:15:50 +0200 Subject: [Pandas-dev] Skyline queries Message-ID: Hello everyone, I was wondering if there was any interest to code a skyline operator in pandas? I just wrote a short blog post on the subject. I can't decide if this is part of pandas' scope, hence the email. However skyline are useful for many data mining applications. The idea would be to have .skyline(to_minimize, to_maximize) function where a user could specify which attributes he wishes to minimize and maximize. The function would return the subset of rows that belong to the skyline (which is slightly different from what I did in my blog post). Kind regards. -- Max Halford +336 28 25 13 38 -------------- next part -------------- An HTML attachment was scrubbed... URL: From tom.augspurger88 at gmail.com Tue May 21 14:20:05 2019 From: tom.augspurger88 at gmail.com (Tom Augspurger) Date: Tue, 21 May 2019 13:20:05 -0500 Subject: [Pandas-dev] Skyline queries In-Reply-To: References: Message-ID: That seems like it's a bit out of pandas' scope. But if that gets added to another package then a PR adding it to pandas' ecosystem page would certainly be welcome. Tom On Tue, May 21, 2019 at 11:33 AM Max Halford wrote: > Hello everyone, > > I was wondering if there was any interest to code a skyline operator > in pandas? I just wrote a short > blog post > on the subject. I can't decide if this is part of pandas' scope, hence the > email. However skyline are useful for many data mining applications. The > idea would be to have .skyline(to_minimize, to_maximize) function where a > user could specify which attributes he wishes to minimize and maximize. The > function would return the subset of rows that belong to the skyline (which > is slightly different from what I did in my blog post). > > Kind regards. > > -- > Max Halford > +336 28 25 13 38 > _______________________________________________ > Pandas-dev mailing list > Pandas-dev at python.org > https://mail.python.org/mailman/listinfo/pandas-dev > -------------- next part -------------- An HTML attachment was scrubbed... URL: From tmrsg11 at gmail.com Sat May 25 00:37:26 2019 From: tmrsg11 at gmail.com (C W) Date: Sat, 25 May 2019 00:37:26 -0400 Subject: [Pandas-dev] Was the range() function ever created in Pandas? Message-ID: Hello all, I am want to calculate the range of a vector. I saw that someone asked for range() in 2011, but was it ever created? https://github.com/pandas-dev/pandas/issues/288 Response at the time was to use df.describe(). But df.describe() gives all the 5-number summary statistics, but I DON'T WANT wall the extra stuff I didn't ask for. I was expecting a numerical number. I can use that to feed into another function. It exists in Matlab and R, why not in Python? I'm quite frustrated every time I need to calculate the range. Thanks in advance. -------------- next part -------------- An HTML attachment was scrubbed... URL: From jorisvandenbossche at gmail.com Sat May 25 08:41:59 2019 From: jorisvandenbossche at gmail.com (Joris Van den Bossche) Date: Sat, 25 May 2019 14:41:59 +0200 Subject: [Pandas-dev] Was the range() function ever created in Pandas? In-Reply-To: References: Message-ID: The issue you mention was actually closed by a commit adding a value_range function, but it was later deprecated and removed again. If you don't want everything that describe() gives, you can calculate (series.min(), series.max()) both manually. If you regularly need this, I would recommend writing a small function yourself that you can easily reuse. What is and what is not included in pandas is always a trade-off: a balance between more functionality out of the box vs an overload of functionality (and especially an too many methods on the Series/DataFrame objects) in which many users get lost. Currently, we already have a lot of methods on Series/DataFrame, and adding an extra needs a very good justification. Personally, I don't think that a range function is worth it. Best, Joris Op za 25 mei 2019 om 09:15 schreef C W : > Hello all, > > I am want to calculate the range of a vector. I saw that someone asked for > range() in 2011, but was it ever created? > https://github.com/pandas-dev/pandas/issues/288 > > Response at the time was to use df.describe(). But df.describe() gives all > the 5-number summary statistics, but I DON'T WANT wall the extra stuff I > didn't ask for. I was expecting a numerical number. I can use that to feed > into another function. > > It exists in Matlab and R, why not in Python? I'm quite frustrated every > time I need to calculate the range. > > Thanks in advance. > _______________________________________________ > Pandas-dev mailing list > Pandas-dev at python.org > https://mail.python.org/mailman/listinfo/pandas-dev > -------------- next part -------------- An HTML attachment was scrubbed... URL: From jorisvandenbossche at gmail.com Sat May 25 08:54:26 2019 From: jorisvandenbossche at gmail.com (Joris Van den Bossche) Date: Sat, 25 May 2019 14:54:26 +0200 Subject: [Pandas-dev] Pandas dev sprint June 27-30 @ Nashville In-Reply-To: References: Message-ID: You are certainly welcome to sprint those days as well. I only can't promise that we can do any things to improve remote participation such as video meetings (but there should be a bunch of core devs active, which might give faster feedback on PRs if needed). Op di 14 mei 2019 om 22:59 schreef Chang She : > I'll be out of the country during those dates. Can I still join in > remotely? > > Here's what I'd be interested in working on if there's appetite for these > to be part of pandas and friends: > > 1. A stale PR on Series.explode I haven't had any time to finish up ( > https://github.com/pandas-dev/pandas/pull/24366). > I think there is still certainly interest in such a function (I have regularly needed something like that myself). > 2. Open sourcing an improvement to the pandas-redshift connector that > speeds up the ingestion of medium amounts of data using a combination of > unload + read_csv + multiprocessing. > That reminds me: it might be good to mention the pandas-redshift packages somewhere in the docs (in the ecosystem page or in the sql docs). > 3. A minor improvement to allow read_parquet to work with globs directly. > This makes it a lot easier for pandas to read parquet generated by Spark. > > Do you know if there is already an open issue about this? Best, Joris > > > On Tue, May 14, 2019 at 4:50 AM Joris Van den Bossche < > jorisvandenbossche at gmail.com> wrote: > >> Dear all, >> >> We are planning to do a pandas sprint end of June in Nashville >> (Tennessee, USA): June 27-30. We will be meeting with some of the core devs >> (so not a sprint to jump-start newcomers in this case), but sending this to >> the mailing list to invite other pandas (or related libraries) contributors. >> The exact planning of the sprint still needs to be discussed, but we will >> probably be hacking and discussing on pandas, extension arrays, next >> versions of pandas, etc. >> >> *So if you are interested, let me know something! *We want to keep the >> number of participants somewhat limited, and also need to plan the location >> and funding, so please state your interest *before May 30*. >> If you would like to participate, but not sure if you would fit at such a >> sprint, don't hesitate to mail me personally. >> >> Best, >> Joris >> >> >> _______________________________________________ >> Pandas-dev mailing list >> Pandas-dev at python.org >> https://mail.python.org/mailman/listinfo/pandas-dev >> > -------------- next part -------------- An HTML attachment was scrubbed... URL: From changshe at gmail.com Sat May 25 16:43:45 2019 From: changshe at gmail.com (Chang She) Date: Sat, 25 May 2019 13:43:45 -0700 Subject: [Pandas-dev] Pandas dev sprint June 27-30 @ Nashville In-Reply-To: References: Message-ID: Oh this was more me just carving time out as a forcing function for myself. I?ll be +13 hour ahead so certainly not expecting video conferences. Code reviews would be appreciated. As for the read_parquet item, I will either attach to an existing issue or open a new one so discussion can happen on github. Thanks. On Saturday, May 25, 2019, Joris Van den Bossche < jorisvandenbossche at gmail.com> wrote: > You are certainly welcome to sprint those days as well. I only can't > promise that we can do any things to improve remote participation such as > video meetings (but there should be a bunch of core devs active, which > might give faster feedback on PRs if needed). > > Op di 14 mei 2019 om 22:59 schreef Chang She : > >> I'll be out of the country during those dates. Can I still join in >> remotely? >> >> Here's what I'd be interested in working on if there's appetite for these >> to be part of pandas and friends: >> >> 1. A stale PR on Series.explode I haven't had any time to finish up ( >> https://github.com/pandas-dev/pandas/pull/24366). >> > > I think there is still certainly interest in such a function (I have > regularly needed something like that myself). > > >> 2. Open sourcing an improvement to the pandas-redshift connector that >> speeds up the ingestion of medium amounts of data using a combination of >> unload + read_csv + multiprocessing. >> > > That reminds me: it might be good to mention the pandas-redshift packages > somewhere in the docs (in the ecosystem page or in the sql docs). > > >> 3. A minor improvement to allow read_parquet to work with globs directly. >> This makes it a lot easier for pandas to read parquet generated by Spark. >> >> Do you know if there is already an open issue about this? > > Best, > Joris > > >> >> >> On Tue, May 14, 2019 at 4:50 AM Joris Van den Bossche < >> jorisvandenbossche at gmail.com> wrote: >> >>> Dear all, >>> >>> We are planning to do a pandas sprint end of June in Nashville >>> (Tennessee, USA): June 27-30. We will be meeting with some of the core devs >>> (so not a sprint to jump-start newcomers in this case), but sending this to >>> the mailing list to invite other pandas (or related libraries) contributors. >>> The exact planning of the sprint still needs to be discussed, but we >>> will probably be hacking and discussing on pandas, extension arrays, next >>> versions of pandas, etc. >>> >>> *So if you are interested, let me know something! *We want to keep the >>> number of participants somewhat limited, and also need to plan the location >>> and funding, so please state your interest *before May 30*. >>> If you would like to participate, but not sure if you would fit at such >>> a sprint, don't hesitate to mail me personally. >>> >>> Best, >>> Joris >>> >>> >>> _______________________________________________ >>> Pandas-dev mailing list >>> Pandas-dev at python.org >>> https://mail.python.org/mailman/listinfo/pandas-dev >>> >> -------------- next part -------------- An HTML attachment was scrubbed... URL: From tmrsg11 at gmail.com Sat May 25 14:41:40 2019 From: tmrsg11 at gmail.com (C W) Date: Sat, 25 May 2019 14:41:40 -0400 Subject: [Pandas-dev] Was the range() function ever created in Pandas? In-Reply-To: References: Message-ID: Thanks Joris. I'm from the statistical side, not cs. Range is very fundamental --- part of the 5-summary statistics. My understanding is that Pandas is the R equivalent. Even the documentation on github claims some methods are just R function written in Python. Best! On Sat, May 25, 2019 at 8:42 AM Joris Van den Bossche < jorisvandenbossche at gmail.com> wrote: > The issue you mention was actually closed by a commit adding a value_range > function, but it was later deprecated and removed again. > > If you don't want everything that describe() gives, you can calculate (series.min(), > series.max()) both manually. > If you regularly need this, I would recommend writing a small function > yourself that you can easily reuse. > > What is and what is not included in pandas is always a trade-off: a > balance between more functionality out of the box vs an overload of > functionality (and especially an too many methods on the Series/DataFrame > objects) in which many users get lost. > Currently, we already have a lot of methods on Series/DataFrame, and > adding an extra needs a very good justification. Personally, I don't think > that a range function is worth it. > > Best, > Joris > > Op za 25 mei 2019 om 09:15 schreef C W : > >> Hello all, >> >> I am want to calculate the range of a vector. I saw that someone asked >> for range() in 2011, but was it ever created? >> https://github.com/pandas-dev/pandas/issues/288 >> >> Response at the time was to use df.describe(). But df.describe() gives >> all the 5-number summary statistics, but I DON'T WANT wall the extra stuff >> I didn't ask for. I was expecting a numerical number. I can use that to >> feed into another function. >> >> It exists in Matlab and R, why not in Python? I'm quite frustrated every >> time I need to calculate the range. >> >> Thanks in advance. >> _______________________________________________ >> Pandas-dev mailing list >> Pandas-dev at python.org >> https://mail.python.org/mailman/listinfo/pandas-dev >> > -------------- next part -------------- An HTML attachment was scrubbed... URL: From tmrsg11 at gmail.com Sun May 26 23:51:16 2019 From: tmrsg11 at gmail.com (C W) Date: Sun, 26 May 2019 23:51:16 -0400 Subject: [Pandas-dev] Was the range() function ever created in Pandas? In-Reply-To: References: Message-ID: C W Sat, May 25, 2:41 PM (1 day ago) to Joris, pandas-dev Why is my reply not getting posted? I am posting again below. Thanks Joris. I'm from the statistical side, not cs. Range is very fundamental --- part of the 5-summary statistics. My understanding is that Pandas is the R equivalent. Even the documentation on github claims some methods are just R function written in Python. Best! On Sat, May 25, 2019 at 8:42 AM Joris Van den Bossche < jorisvandenbossche at gmail.com> wrote: > The issue you mention was actually closed by a commit adding a value_range > function, but it was later deprecated and removed again. > > If you don't want everything that describe() gives, you can calculate (series.min(), > series.max()) both manually. > If you regularly need this, I would recommend writing a small function > yourself that you can easily reuse. > > What is and what is not included in pandas is always a trade-off: a > balance between more functionality out of the box vs an overload of > functionality (and especially an too many methods on the Series/DataFrame > objects) in which many users get lost. > Currently, we already have a lot of methods on Series/DataFrame, and > adding an extra needs a very good justification. Personally, I don't think > that a range function is worth it. > > Best, > Joris > > Op za 25 mei 2019 om 09:15 schreef C W : > >> Hello all, >> >> I am want to calculate the range of a vector. I saw that someone asked >> for range() in 2011, but was it ever created? >> https://github.com/pandas-dev/pandas/issues/288 >> >> Response at the time was to use df.describe(). But df.describe() gives >> all the 5-number summary statistics, but I DON'T WANT wall the extra stuff >> I didn't ask for. I was expecting a numerical number. I can use that to >> feed into another function. >> >> It exists in Matlab and R, why not in Python? I'm quite frustrated every >> time I need to calculate the range. >> >> Thanks in advance. >> _______________________________________________ >> Pandas-dev mailing list >> Pandas-dev at python.org >> https://mail.python.org/mailman/listinfo/pandas-dev >> > -------------- next part -------------- An HTML attachment was scrubbed... URL: From jreback at yahoo.com Wed May 29 17:59:20 2019 From: jreback at yahoo.com (Jeff Reback) Date: Wed, 29 May 2019 17:59:20 -0400 Subject: [Pandas-dev] welcome to new committers Message-ID: I would like to warmly welcome 2 new committees to the pandas core team: Simon Hawkins (simonjayhawkins) Terji Peterson (topper-123) They have consistently pushed quality PRs and have helped out other contributors. We hope that they can help make pandas even better going forward. Welcome and thanks Jeff From jorisvandenbossche at gmail.com Thu May 30 12:43:19 2019 From: jorisvandenbossche at gmail.com (Joris Van den Bossche) Date: Thu, 30 May 2019 18:43:19 +0200 Subject: [Pandas-dev] welcome to new committers In-Reply-To: References: Message-ID: Congratulations Simon and Terji, and thanks for all your contributions! Op wo 29 mei 2019 om 23:59 schreef Jeff Reback via Pandas-dev < pandas-dev at python.org>: > I would like to warmly welcome 2 new committees to the pandas core team: > > Simon Hawkins (simonjayhawkins) > Terji Peterson (topper-123) > > They have consistently pushed quality PRs and have helped out other > contributors. We hope that they can help make pandas even better going > forward. > > Welcome and thanks > Jeff > _______________________________________________ > Pandas-dev mailing list > Pandas-dev at python.org > https://mail.python.org/mailman/listinfo/pandas-dev > -------------- next part -------------- An HTML attachment was scrubbed... URL: From wesmckinn at gmail.com Thu May 30 13:03:29 2019 From: wesmckinn at gmail.com (Wes McKinney) Date: Thu, 30 May 2019 12:03:29 -0500 Subject: [Pandas-dev] welcome to new committers In-Reply-To: References: Message-ID: Welcome to the team, and thank you for being involved with the project! On Thu, May 30, 2019 at 11:43 AM Joris Van den Bossche wrote: > > Congratulations Simon and Terji, and thanks for all your contributions! > > Op wo 29 mei 2019 om 23:59 schreef Jeff Reback via Pandas-dev : >> >> I would like to warmly welcome 2 new committees to the pandas core team: >> >> Simon Hawkins (simonjayhawkins) >> Terji Peterson (topper-123) >> >> They have consistently pushed quality PRs and have helped out other contributors. We hope that they can help make pandas even better going forward. >> >> Welcome and thanks >> Jeff >> _______________________________________________ >> Pandas-dev mailing list >> Pandas-dev at python.org >> https://mail.python.org/mailman/listinfo/pandas-dev > > _______________________________________________ > Pandas-dev mailing list > Pandas-dev at python.org > https://mail.python.org/mailman/listinfo/pandas-dev From wesmckinn at gmail.com Thu May 30 13:56:27 2019 From: wesmckinn at gmail.com (Wes McKinney) Date: Thu, 30 May 2019 12:56:27 -0500 Subject: [Pandas-dev] Pandas dev sprint June 27-30 @ Nashville In-Reply-To: References: Message-ID: @Joris if you have a complete headcount for the meeting please let me know so I can work on securing a space for us to work. Just to confirm it's the 27th through the 30th inclusive, so 4 full days of workspace required? Thanks On Sat, May 25, 2019 at 3:44 PM Chang She wrote: > > Oh this was more me just carving time out as a forcing function for myself. I?ll be +13 hour ahead so certainly not expecting video conferences. Code reviews would be appreciated. > > As for the read_parquet item, I will either attach to an existing issue or open a new one so discussion can happen on github. > > Thanks. > > On Saturday, May 25, 2019, Joris Van den Bossche wrote: >> >> You are certainly welcome to sprint those days as well. I only can't promise that we can do any things to improve remote participation such as video meetings (but there should be a bunch of core devs active, which might give faster feedback on PRs if needed). >> >> Op di 14 mei 2019 om 22:59 schreef Chang She : >>> >>> I'll be out of the country during those dates. Can I still join in remotely? >>> >>> Here's what I'd be interested in working on if there's appetite for these to be part of pandas and friends: >>> >>> 1. A stale PR on Series.explode I haven't had any time to finish up (https://github.com/pandas-dev/pandas/pull/24366). >> >> >> I think there is still certainly interest in such a function (I have regularly needed something like that myself). >> >>> >>> 2. Open sourcing an improvement to the pandas-redshift connector that speeds up the ingestion of medium amounts of data using a combination of unload + read_csv + multiprocessing. >> >> >> That reminds me: it might be good to mention the pandas-redshift packages somewhere in the docs (in the ecosystem page or in the sql docs). >> >>> >>> 3. A minor improvement to allow read_parquet to work with globs directly. This makes it a lot easier for pandas to read parquet generated by Spark. >>> >> Do you know if there is already an open issue about this? >> >> Best, >> Joris >> >>> >>> >>> >>> On Tue, May 14, 2019 at 4:50 AM Joris Van den Bossche wrote: >>>> >>>> Dear all, >>>> >>>> We are planning to do a pandas sprint end of June in Nashville (Tennessee, USA): June 27-30. We will be meeting with some of the core devs (so not a sprint to jump-start newcomers in this case), but sending this to the mailing list to invite other pandas (or related libraries) contributors. >>>> The exact planning of the sprint still needs to be discussed, but we will probably be hacking and discussing on pandas, extension arrays, next versions of pandas, etc. >>>> >>>> So if you are interested, let me know something! We want to keep the number of participants somewhat limited, and also need to plan the location and funding, so please state your interest before May 30. >>>> If you would like to participate, but not sure if you would fit at such a sprint, don't hesitate to mail me personally. >>>> >>>> Best, >>>> Joris >>>> >>>> >>>> _______________________________________________ >>>> Pandas-dev mailing list >>>> Pandas-dev at python.org >>>> https://mail.python.org/mailman/listinfo/pandas-dev > > _______________________________________________ > Pandas-dev mailing list > Pandas-dev at python.org > https://mail.python.org/mailman/listinfo/pandas-dev