[Pandas-dev] PDEPs: pandas enhancement proposals

Matthew Roeschke emailformattr at gmail.com
Sat Jun 25 15:18:49 EDT 2022


I find Marc's arguments regarding general simplicity of PDEP flow
(publishing to website & integration to the main repo) a strong argument to
keep these in the main repo.

Since there is a dependency between PDEP development and the pandas-dev
repo development, having them separated may lead to similar workflow
challenges with the MacPython/pandas-wheels repo for example (where
ciwheelbuild
being integrated into the main repo
<https://github.com/pandas-dev/pandas/issues/44027> is considered a benefit
due to tighter integration).

I agree PDEP visibility from notifications is important, but notification
priority and channels can differ person-to-person. For example, I just
manage my GIthub notifications in Github, not email.

On Sat, Jun 25, 2022 at 10:50 AM Tom Augspurger <tom.w.augspurger at gmail.com>
wrote:

> For me, notifications are the big thing. Having the emails come from a
> separate repo would make following things much easier for those who can’t
> keep up with the main repo.
>
> Tom
>
> On Jun 25, 2022, at 12:04 PM, Marc Garcia <garcia.marc at gmail.com> wrote:
>
> 
> Thanks for the feedback. I understand your point about using a different
> repo, but I see several advantages on the current approach, so maybe worth
> discussing a bit further what are the exact pain points, to see if a
> separate repo is really the best solution.
>
> Let me know if I miss something, but I see three different ways in which
> we'll be interacting with PDEPs:
>
> a) Via their rendered version. Not sure if you checked it, but the current
> rendered page from the PDEP PR (attached) is equivalent to the home of the
> scikit-learn SLEP proposals [1]. The main difference is that with the
> current approach we have it integrated with the website, which I personally
> think it's an advantage.
>
> b) Via the list of PDEP PRs to review. In this case, to see only PDEP PRs,
> if we use the main pandas repo, this is just a label filter [2]. To me
> personally quicker than having to go to another repo, but no big difference
> about one or the other.
>
> c) Notifications. I guess this is the main thing. I think one concern is
> that notifications from PDEPs get lost in the rest of the repo
> notifications. I assume you're using your email client filters, and if the
> notifications come from another repo, you can change the rules easily. I
> guess the solution here would be to use something like PDEP in the title
> and use that as a rule. Or we can try to find something more reliable, if
> that's the main concern.
>
> Personally, I don't see the advantages of having the proposals in a
> separate repo very significant. And by keeping things the way they're
> implemented in the PR, I do see some advantages:
> - No need to maintain a separate repo, CI workflow, jobs to publish the
> build, sphinx (or equivalent) project... Nothing too complex, by why having
> to implement and maintain all that if our website is already prepared to
> handle it. And in particular, with Sphinx is not as easy as with out
> website to fetch the open PRs and render them.
> - Integrated UX of the PDEPs into our website. I think this gives it more
> visibility, and a better using experience than having to jump from one
> website to another.
> - One of my concerns is that being in a separate repo we forget about
> them. We're used to check PRs in the pandas repo, and we'll keep coming
> back to PRs about PDEPs until they're merged if they are in the main repo,
> but feels like being in a separate repo is easier to forget them when there
> is no recent activity and notifications.
>
> It would be good to know if I miss any of your concerns. If I didn't, I'd
> say we can start with what's already implemented, which is almost ready to
> get merged, and if in the future you still think we can do better by using
> a separate repo, you can implement it, we have a discussion about it, and
> we move PDEPs to a separate repo if that makes sense. What do you think?
>
> Cheers,
> Marc
>
> 1. https://scikit-learn-enhancement-proposals.readthedocs.io/en/latest/
> 2.
> https://github.com/pandas-dev/pandas/pulls?q=is%3Aopen+is%3Apr+label%3APDEP
>
>
> On Sat, Jun 25, 2022 at 7:05 AM Jeff Reback <jeffreback at gmail.com> wrote:
>
>> +1 in using a separate repo (under pandas-dev) for this
>>
>>
>> On Jun 24, 2022, at 5:05 PM, Joris Van den Bossche <
>> jorisvandenbossche at gmail.com> wrote:
>>
>> 
>> Thanks for starting this proposal, Marc!
>>
>> I have already been doing this in some ad-hoc way with eg the Copy/View
>> proposal (writing an actual proposal document), so I am very much in favor
>> of formalizing this a bit more.
>>
>> Personally, I would prefer that we use a more dedicated home for this
>> instead of using the existing pandas repo (e.g. a separate repo in the
>> pandas-dev org). The main pandas repo has nowadays such a high volume in
>> issue and PR comments, that it becomes difficult to follow this or notice
>> specific issues. While there are certainly ways to deal with this (e.g.
>> consistently using a specific label and title, ensuring we always notify
>> the mailing list as well, ...), IMO it would make it more accessible to
>> follow and have an overview of those discussions in e.g. a separate repo.
>>
>> (there are examples of both in other projects, for example scikit-learn
>> has a separate repo, while bumpy uses the main repo I think)
>>
>> Joris
>>
>> Op di 21 jun. 2022 09:46 schreef Marc Garcia <garcia.marc at gmail.com>:
>>
>>> We're in the process of implementing PDEPs, equivalent to Python's PEPs
>>> and NumPy's NEPs, but for pandas. This should help build the roadmap, make
>>> discussions more efficient, obtain more structured feedback from the
>>> community, and add visibility to agreed future plans for pandas.
>>>
>>> The initial implementation (workflow) is a bit simpler than PEP or NEP,
>>> but we'll iterate in the future as convenient.
>>>
>>> You can see the PR for PDEP-1 with the purpose, scope and guidelines
>>> here: https://github.com/pandas-dev/pandas/pull/47444
>>>
>>> Feedback is very welcome.
>>> _______________________________________________
>>> Pandas-dev mailing list
>>> Pandas-dev at python.org
>>> https://mail.python.org/mailman/listinfo/pandas-dev
>>>
>> _______________________________________________
>> Pandas-dev mailing list
>> Pandas-dev at python.org
>> https://mail.python.org/mailman/listinfo/pandas-dev
>>
>> [image: Screenshot at 2022-06-25 22-20-50.png]
> _______________________________________________
> Pandas-dev mailing list
> Pandas-dev at python.org
> https://mail.python.org/mailman/listinfo/pandas-dev
>
> _______________________________________________
> Pandas-dev mailing list
> Pandas-dev at python.org
> https://mail.python.org/mailman/listinfo/pandas-dev
>


-- 
Matthew Roeschke
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://mail.python.org/pipermail/pandas-dev/attachments/20220625/88087a71/attachment-0001.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: Screenshot at 2022-06-25 22-20-50.png
Type: image/png
Size: 192689 bytes
Desc: not available
URL: <https://mail.python.org/pipermail/pandas-dev/attachments/20220625/88087a71/attachment-0001.png>


More information about the Pandas-dev mailing list