From matematica.a3k at gmail.com Thu Nov 4 10:18:04 2021 From: matematica.a3k at gmail.com (=?UTF-8?Q?Matem=C3=A1tica_A3K?=) Date: Thu, 4 Nov 2021 11:18:04 -0300 Subject: [scikit-learn] Help interpreting decision function plot In-Reply-To: References: Message-ID: Is this better suited for office hours? On Sun, Oct 31, 2021 at 2:07 PM Matem?tica A3K wrote: > Hi! > > I have been building a tool that integrates statistical engines - > specially scikit-learn - with django called django-ai > . > > With that tool, I have built another, covid-ht > , which should showcase the power > of those together. > > That tool is meant to help health professionals with classification tasks > based on measurements > . > > The tool is heading to its first release as a technology preview, and in > this process I have faced a release-blocker issue for which I would like to > ask for your help: I can't find a consistent interpretation of the graphs. > > The graphs are called "conditional decision functions > ", > where each one is the contour of the decision function of a classifier for > an observation in 2 variables while leaving the others fixed. > > The graphs show classification regions as expected, but my initial > interpretation seems wrong (commented out > > ). > > If that explanation was good, I would expect that perturbing one variable > in a direction where the graph shows another class should switch the > classification, as the remaining variables are fixed and that should be the > value that the classifier uses to decide - which is plotted in that plane. > > That is not happening, as you may check here > (the classifier being used is an > Histogram-based Gradient Boosting Classification Tree). > > Any insight about the situation will be highly appreciated and thankful in > advance. > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From g.lemaitre58 at gmail.com Sun Nov 7 08:35:36 2021 From: g.lemaitre58 at gmail.com (=?utf-8?Q?Guillaume_Lema=C3=AEtre?=) Date: Sun, 7 Nov 2021 14:35:36 +0100 Subject: [scikit-learn] scikit-learn office hours on Monday Nov. 8 2021 Message-ID: <7B815880-7B17-4C7A-B5DE-7267EF4E6794@gmail.com> Hi all, Some of us will be online on the scikit-learn discord this Monday at 11:00 ET / 16:00 UTC / 17:00 CET for about an hour or so. First time and occasional contributors are welcome to join us to discord using this invitation link: https://discord.gg/YBdN45kD The focus of these office hour sessions is to answer questions about contributing to scikit-learn. We can also split into break out audio/text channels and do pair programming or live reviewing of forgotten pull requests with screen sharing. We can also try to assist you into crafting minimal reproduction cases for bug reports to get a higher likelihood of resolution (e.g. https://matthewrocklin.com/blog/work/2018/02/28/minimal-bug-reports ). Please note, our Code of Conduct applies: https://github.com/scikit-learn/scikit-learn/blob/main/CODE_OF_CONDUCT.md If this experiment is successful, we will probably hold this kind of office hours on a regular basis. See you soon on discord! -- Guillaume Lemaitre Scikit-learn @ Inria Foundation https://glemaitre.github.io/ -------------- next part -------------- An HTML attachment was scrubbed... URL: From g.lemaitre58 at gmail.com Sun Nov 7 11:46:45 2021 From: g.lemaitre58 at gmail.com (=?utf-8?Q?Guillaume_Lema=C3=AEtre?=) Date: Sun, 7 Nov 2021 17:46:45 +0100 Subject: [scikit-learn] scikit-learn office hours on Monday Nov. 8 2021 In-Reply-To: <7B815880-7B17-4C7A-B5DE-7267EF4E6794@gmail.com> References: <7B815880-7B17-4C7A-B5DE-7267EF4E6794@gmail.com> Message-ID: <41F49D98-D4AF-4968-8665-6EEAD96E8557@gmail.com> Dear all, Please find a new discord invite since the previous invitation expired: https://discord.gg/84atnsdjTa Cheers, -- Guillaume Lemaitre Scikit-learn @ Inria Foundation https://glemaitre.github.io/ > On 7 Nov 2021, at 14:35, Guillaume Lema?tre wrote: > > Hi all, > > Some of us will be online on the scikit-learn discord this Monday at > 11:00 ET / 16:00 UTC / 17:00 CET for about an hour or so. > > First time and occasional contributors are welcome to join us to > discord using this invitation link: > https://discord.gg/YBdN45kD > > The focus of these office hour sessions is to answer questions about > contributing to scikit-learn. We can also split into break out > audio/text channels and do pair programming or live reviewing of > forgotten pull requests with screen sharing. > > We can also try to assist you into crafting minimal reproduction cases > for bug reports to get a higher likelihood of resolution (e.g. > https://matthewrocklin.com/blog/work/2018/02/28/minimal-bug-reports ). > > Please note, our Code of Conduct applies: > https://github.com/scikit-learn/scikit-learn/blob/main/CODE_OF_CONDUCT.md > > If this experiment is successful, we will probably hold this kind of > office hours on a regular basis. > > See you soon on discord! > -- > Guillaume Lemaitre > Scikit-learn @ Inria Foundation > https://glemaitre.github.io/ -------------- next part -------------- An HTML attachment was scrubbed... URL: From yanghaha217 at gmail.com Wed Nov 10 09:19:26 2021 From: yanghaha217 at gmail.com (=?UTF-8?B?5p2o5ZOI5ZOI?=) Date: Wed, 10 Nov 2021 22:19:26 +0800 Subject: [scikit-learn] Fwd: In-Reply-To: References: Message-ID: ---------- ????? --------- ???? ??? ???2021?11?10? ????10:08 ??? ???? Excuse me, I'm a Chinese student, Yang.I want to cite sklearn package in my paper, because it's very efficient to machine learning. But I'm doubt about references in official webpage. For example, the references in '*https://scikit-learn.org/stable/modules/generated/sklearn.svm.SVC.html#sklearn.svm.SVC *' is *LIBSVM:A Library for Support Vector Machines* and *Probabilistic outputs for support vector machines and comparison to regularizedlikelihood methods. * May I ask you what are these references for?For the webpage or the package? Or I just need to cite *Scikit-learn: Machine Learning in Python, Pedregosa et al., JMLR 12, pp. 2825-2830, 2011 *in* 'https://scikit-learn.org/stable/about.html#citing-scikit-learn '?* I would be appreciated if you can reply me. -------------- next part -------------- An HTML attachment was scrubbed... URL: From g.lemaitre58 at gmail.com Wed Nov 10 09:44:30 2021 From: g.lemaitre58 at gmail.com (=?utf-8?Q?Guillaume_Lema=C3=AEtre?=) Date: Wed, 10 Nov 2021 15:44:30 +0100 Subject: [scikit-learn] (no subject) In-Reply-To: References: Message-ID: <861DD81D-BA23-46B2-805E-2A2F22148C38@gmail.com> You can refer to https://scikit-learn.org/stable/about.html#citing-scikit-learn depending what is the scope of your research paper. -- Guillaume Lemaitre Scikit-learn @ Inria Foundation https://glemaitre.github.io/ > On 10 Nov 2021, at 15:19, ??? wrote: > > > > ---------- ????? --------- > ???? ??? > > ???2021?11?10? ????10:08 > ??? > ???? > > > > Excuse me, I'm a Chinese student, Yang.I want to cite sklearn package in my paper, because it's very efficient to machine learning. > But I'm doubt about references in official webpage. For example, the references in 'https://scikit-learn.org/stable/modules/generated/sklearn.svm.SVC.html#sklearn.svm.SVC ' is LIBSVM:A Library for Support Vector Machines and Probabilistic outputs for support vector machines and comparison to regularizedlikelihood methods. > May I ask you what are these references for?For the webpage or the package? Or I just need to cite Scikit-learn: Machine Learning in Python, Pedregosa et al., JMLR 12, pp. 2825-2830, 2011 in 'https://scikit-learn.org/stable/about.html#citing-scikit-learn '? > I would be appreciated if you can reply me. > > _______________________________________________ > scikit-learn mailing list > scikit-learn at python.org > https://mail.python.org/mailman/listinfo/scikit-learn -------------- next part -------------- An HTML attachment was scrubbed... URL: From ralf.gommers at gmail.com Wed Nov 17 12:46:45 2021 From: ralf.gommers at gmail.com (Ralf Gommers) Date: Wed, 17 Nov 2021 18:46:45 +0100 Subject: [scikit-learn] request for feedback on cross-library design proposal Message-ID: (Apologies for the cross-posting, if you follow more than one of these lists, you may see this multiple times.) Hi all, I'd like to solicit the feedback of maintainers of NumPy, SciPy, scikit-learn, and scikit-image, and other interested community members on this blog post and proposed design to enable multiple array types in SciPy, scikit-learn, scikit-image and other libraries in this ecosystem: https://labs.quansight.org/blog/2021/11/pydata-extensibility-vision/. Because this design is meant to be shared across projects, it would be great to discuss it in a central place. The new https://discuss.scientific-python.org/ seems like a good venue for that. Here are two separate forum threads for this: - for the big picture: https://discuss.scientific-python.org/t/a-proposed-design-for-supporting-multiple-array-types-across-scipy-scikit-learn-scikit-image-and-beyond/131 - for the most tricky design decision (user-facing switch & default behavior): https://discuss.scientific-python.org/t/default-dispatching-behavior-for-supporting-multiple-array-types-across-scipy-scikit-learn-scikit-image/135 Thanks in advance for any input! Cheers, Ralf -------------- next part -------------- An HTML attachment was scrubbed... URL: From thomasjpfan at gmail.com Sat Nov 20 11:52:26 2021 From: thomasjpfan at gmail.com (Thomas J. Fan) Date: Sat, 20 Nov 2021 08:52:26 -0800 Subject: [scikit-learn] scikit-learn monthly developer meeting: Monday November 29th 2021 Message-ID: Dear all, The scikit-learn developer monthly meeting will take place on Monday November 29 at 22:00 UTC. - Video call link: https://meet.google.com/ews-uszu-djs - Meeting notes / agenda: https://hackmd.io/0yokz72CTZSny8y3Re648Q - Local times: https://www.timeanddate.com/worldclock/meetingdetails.html?year=2021&month=11&day=29&hour=22&min=0&sec=0&p1=1440&p2=240&p3=248&p4=195&p5=179&p6=224 The goal of this meeting is to discuss ongoing development topics for the project. Everybody is welcome. As usual, please follow the code of conduct of the project: https://github.com/scikit-learn/scikit-learn/blob/main/CODE_OF_CONDUCT.md Regards, Thomas -------------- next part -------------- An HTML attachment was scrubbed... URL: From reshama.stat at gmail.com Mon Nov 22 06:00:00 2021 From: reshama.stat at gmail.com (Reshama Shaikh) Date: Mon, 22 Nov 2021 06:00:00 -0500 Subject: [scikit-learn] [Data Umbrella] AFME (Africa & Middle East) scikit-learn open source sprint (scikit-learn) In-Reply-To: References: Message-ID: Hello, The report from the Data Umbrella Africa & Middle East sprint is here [a]. SUMMARY - 40 people joined - 17 countries represented - 57% were returning contributors There are a lot of good plots in the report. This is one of the first times I've examined attrition more closely, related to gender and country. Thanks to everyone on the Data Umbrella and scikit-learn teams for their assistance in making this happen! [a]: https://blog.dataumbrella.org/data-umbrella-afme2-2021-scikit-learn-sprint-report Best, Reshama --- Reshama Shaikh she/her Blog | Twitter | LinkedIn | GitHub Data Umbrella NYC PyLadies On Mon, Oct 11, 2021 at 8:00 AM Reshama Shaikh wrote: > Hello, > At this time, we have a few spots open for the upcoming October 23 > online scikit-learn sprint organized by Data Umbrella. > > If you reside outside of the Africa and Middle East region, you are now > able to apply. > https://afme2021rc.dataumbrella.org/home > > Note 1: we offer a stipend of $10 USD to cover the cost of internet > access, and you can indicate such on your application. > > Note 2: if you need a translator, please indicate so on your > application. > > Key Notes: > a) There is a pre-sprint event on Saturday October 16 from 5-6pm EAT. > This pre-sprint event is *optional* and an opportunity to answer any > questions in general and aid in setting up your virtual environment. > > b) Sprint is on *Saturday, October 23 at 5pm - 9pm EAT (East Africa > Time) *on our Discord server. > > c) There is a post-sprint event on Saturday November 23 from 5-6pm EAT. > This post-sprint event is *optional* and an opportunity to ask the core > devs questions on open pull requests. > > d) There is 3-4 hours of pre-work for the sprint. Here is the > checklist: https://afme2021rc.dataumbrella.org/about/prep-work > > Please feel free to send any questions to me off the mailing list. > > Best, > Reshama > Reshama Shaikh > she/her > Blog | Twitter > | LinkedIn | GitHub > > > Data Umbrella > NYC PyLadies > > > > On Sat, Sep 25, 2021 at 5:05 PM Reshama Shaikh > wrote: > >> Hello, >> >> Data Umbrella is organizing a scikit-learn sprint for this October 23, >> with a focus on **Africa and the Middle East**. This event is free. >> >> A sprint is a 4-hour hands-on hackathon where we work on beginner issues >> in the scikit-learn GitHub repository. Participants will be paired with >> another person. There will be core contributors available to answer any >> questions. >> >> Event website is: https://afme2021rc.dataumbrella.org >> We encourage folks to read the website and then complete the application. >> >> The event can be shared in these ways: >> - Retweet: https://twitter.com/DataUmbrella/status/1435972074842034184 >> - Share post on LinkedIn: >> https://www.linkedin.com/feed/update/urn:li:activity:6841738994305294336/ >> >> Please feel free to contact me if you have any questions. >> >> Cheers, >> Reshama Shaikh >> she/her >> >> -------------- next part -------------- An HTML attachment was scrubbed... URL: From adrin.jalali at gmail.com Mon Nov 22 17:33:53 2021 From: adrin.jalali at gmail.com (Adrin) Date: Mon, 22 Nov 2021 23:33:53 +0100 Subject: [scikit-learn] [Data Umbrella] AFME (Africa & Middle East) scikit-learn open source sprint (scikit-learn) In-Reply-To: References: Message-ID: Thanks Reshama, That's a really nice report! On Mon, Nov 22, 2021 at 12:01 PM Reshama Shaikh wrote: > Hello, > The report from the Data Umbrella Africa & Middle East sprint is here [a]. > > SUMMARY > - 40 people joined > - 17 countries represented > - 57% were returning contributors > > There are a lot of good plots in the report. This is one of the first > times I've examined attrition more closely, related to gender and country. > > Thanks to everyone on the Data Umbrella and scikit-learn teams for their > assistance in making this happen! > > [a]: > https://blog.dataumbrella.org/data-umbrella-afme2-2021-scikit-learn-sprint-report > > Best, > Reshama > --- > Reshama Shaikh > she/her > Blog | Twitter > | LinkedIn | GitHub > > > Data Umbrella > NYC PyLadies > > > > On Mon, Oct 11, 2021 at 8:00 AM Reshama Shaikh > wrote: > >> Hello, >> At this time, we have a few spots open for the upcoming October 23 >> online scikit-learn sprint organized by Data Umbrella. >> >> If you reside outside of the Africa and Middle East region, you are now >> able to apply. >> https://afme2021rc.dataumbrella.org/home >> >> Note 1: we offer a stipend of $10 USD to cover the cost of internet >> access, and you can indicate such on your application. >> >> Note 2: if you need a translator, please indicate so on your >> application. >> >> Key Notes: >> a) There is a pre-sprint event on Saturday October 16 from 5-6pm EAT. >> This pre-sprint event is *optional* and an opportunity to answer any >> questions in general and aid in setting up your virtual environment. >> >> b) Sprint is on *Saturday, October 23 at 5pm - 9pm EAT (East Africa >> Time) *on our Discord server. >> >> c) There is a post-sprint event on Saturday November 23 from 5-6pm EAT. >> This post-sprint event is *optional* and an opportunity to ask the core >> devs questions on open pull requests. >> >> d) There is 3-4 hours of pre-work for the sprint. Here is the >> checklist: https://afme2021rc.dataumbrella.org/about/prep-work >> >> Please feel free to send any questions to me off the mailing list. >> >> Best, >> Reshama >> Reshama Shaikh >> she/her >> Blog | Twitter >> | LinkedIn >> | GitHub >> >> >> Data Umbrella >> NYC PyLadies >> >> >> >> On Sat, Sep 25, 2021 at 5:05 PM Reshama Shaikh >> wrote: >> >>> Hello, >>> >>> Data Umbrella is organizing a scikit-learn sprint for this October 23, >>> with a focus on **Africa and the Middle East**. This event is free. >>> >>> A sprint is a 4-hour hands-on hackathon where we work on beginner issues >>> in the scikit-learn GitHub repository. Participants will be paired with >>> another person. There will be core contributors available to answer any >>> questions. >>> >>> Event website is: https://afme2021rc.dataumbrella.org >>> We encourage folks to read the website and then complete the application. >>> >>> The event can be shared in these ways: >>> - Retweet: https://twitter.com/DataUmbrella/status/1435972074842034184 >>> - Share post on LinkedIn: >>> https://www.linkedin.com/feed/update/urn:li:activity:6841738994305294336/ >>> >>> Please feel free to contact me if you have any questions. >>> >>> Cheers, >>> Reshama Shaikh >>> she/her >>> >>> _______________________________________________ > scikit-learn mailing list > scikit-learn at python.org > https://mail.python.org/mailman/listinfo/scikit-learn > -------------- next part -------------- An HTML attachment was scrubbed... URL: