From ml_news at posteo.de Sun Dec 1 03:29:47 2019 From: ml_news at posteo.de (Manfred Lotz) Date: Sun, 1 Dec 2019 09:29:47 +0100 Subject: "Don't install on the system Python" References: <0a065288-f851-4afc-9104-f5b5ca5188f9@googlegroups.com> Message-ID: <20191201092947.556bceab@arcor.com> On Sat, 30 Nov 2019 20:42:21 -0800 (PST) John Ladasky wrote: > Long-time Ubuntu user here. > > For years, I've read warnings about not installing one's personal > stack of Python modules on top of the system Python. It is possible > to corrupt the OS, or so I've gathered. > This is nonsense as you presumably have no permission to change anything Python related in /usr. The only possiblity I can imagine is that you somehow screw up your personal Python related setting in your home directory tree. But I have never (in the short period of time I've been using Python) encountered anything like this. -- Manfred From cs at cskk.id.au Sun Dec 1 03:41:37 2019 From: cs at cskk.id.au (Cameron Simpson) Date: Sun, 1 Dec 2019 19:41:37 +1100 Subject: "Don't install on the system Python" In-Reply-To: <20191201092947.556bceab@arcor.com> References: <20191201092947.556bceab@arcor.com> Message-ID: <20191201084137.GA84892@cskk.homeip.net> On 01Dec2019 09:29, Manfred Lotz wrote: >On Sat, 30 Nov 2019 20:42:21 -0800 (PST) >John Ladasky wrote: >> For years, I've read warnings about not installing one's personal >> stack of Python modules on top of the system Python. It is possible >> to corrupt the OS, or so I've gathered. > >This is nonsense as you presumably have no permission to change >anything Python related in /usr. > >The only possiblity I can imagine is that you somehow screw up your >personal Python related setting in your home directory tree. But I have >never (in the short period of time I've been using Python) encountered >anything like this. What is to be avoided: Some people run pip as root and install in the vendor/supplier controlled space. This can lead to various problems, as it can conflict with or simply vary the system installed packages. Provided the OP is using pip in its (modern default) "install in my home directory" mode, they should be fine. Cheers, Cameron Simpson From __peter__ at web.de Sun Dec 1 04:26:58 2019 From: __peter__ at web.de (Peter Otten) Date: Sun, 01 Dec 2019 10:26:58 +0100 Subject: ModuleNotFoundError with click module References: <32cc3ffc-bb43-284f-dfd8-7451ba100ec7@akwebsoft.com> Message-ID: Tim Johnson wrote: > Using linux ubuntu 16.04 with bash shell. > Am retired python programmer, but not terribly current. > I have moderate bash experience. > > When trying to install pgadmin4 via apt I get the following error > traceback when pgadmin4 is invoked: > > Traceback (most recent call last): > File "setup.py", line 17, in > from pgadmin.model import db, User, Version, ServerGroup, Server, \ > File "/usr/share/pgadmin4/web/pgadmin/__init__.py", line 19, in > from flask import Flask, abort, request, current_app, session, url_for > File "/usr/local/lib/python3.7/site-packages/flask/__init__.py", line > 21, in > from .app import Flask > File "/usr/local/lib/python3.7/site-packages/flask/app.py", line 34, > in > from . import cli > File "/usr/local/lib/python3.7/site-packages/flask/cli.py", line 25, in > > import click > ModuleNotFoundError: No module named 'click' > > > If I invoke python3 (/usr/local/bin/python3), version 3.7.2 and invoke > >>> import click > click is imported successfully. > > In this invocation, sys.path is: > ['', '/usr/local/lib/python37.zip', '/usr/local/lib/python3.7', > '/usr/local/lib/python3.7/lib-dynload', > '/home/tim/.local/lib/python3.7/site-packages', > '/usr/local/lib/python3.7/site-packages'] > > $PYTHONPATH is empty when the bash shell is invoked > > $PATH as follows: > /home/tim/bin:/home/tim/.local/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin:/usr/games:/usr/local/games:/snap/bin > > click.py can be found at > /usr/local/lib/python3.7/site-packages/pipenv/patched/piptools/ > in turn click.py imports click, presumably as the package, > which appears to be at > /usr/local/lib/python3.7/site-packages/pipenv/vendor/click > > Any number of settings of PYTHONPATH to the various paths above has > failed to resolve the ModuleNotFoundError > Same issues with attempting install from a virtual environment. > > Any help will be appreciated. > thanks > tim > I'm too lazy to look into the details of your paths -- I'd just make sure that click is installed with the same interpreter and user as pgadmin4, e. g. globally $ sudo /usr/local/bin/python3 -m pip install click $ sudo /usr/local/bin/python3 path/to/setup.py install # or whatever it takes to install pgadmin4 or (better) in a virtual environment $ /usr/local/bin/python3 -m venv whatever $ cd whatever $ . bin/activate $ pip install click $ python path/to/setup.py From john_ladasky at sbcglobal.net Sun Dec 1 04:33:50 2019 From: john_ladasky at sbcglobal.net (John Ladasky) Date: Sun, 1 Dec 2019 01:33:50 -0800 (PST) Subject: "Don't install on the system Python" In-Reply-To: References: <20191201092947.556bceab@arcor.com> <20191201084137.GA84892@cskk.homeip.net> Message-ID: On Sunday, December 1, 2019 at 12:47:43 AM UTC-8, Cameron Simpson wrote: > On 01Dec2019 09:29, Manfred Lotz <... at posteo.de> wrote: > >On Sat, 30 Nov 2019 20:42:21 -0800 (PST) > >John Ladasky <... at sbcglobal.net> wrote: > >> For years, I've read warnings about not installing one's personal > >> stack of Python modules on top of the system Python. It is possible > >> to corrupt the OS, or so I've gathered. > > > >This is nonsense as you presumably have no permission to change > >anything Python related in /usr. > > > >The only possiblity I can imagine is that you somehow screw up your > >personal Python related setting in your home directory tree. But I have > >never (in the short period of time I've been using Python) encountered > >anything like this. > > What is to be avoided: Some people run pip as root and install in the > vendor/supplier controlled space. This can lead to various problems, as > it can conflict with or simply vary the system installed packages. > > Provided the OP is using pip in its (modern default) "install in my home > directory" mode, they should be fine. > > Cheers, > Cameron Simpson <... at cskk.id.au> The only thing I must install with pip is tensorflow-gpu. For everything else, I make use of the Ubuntu repositories. The Synaptic package manager installs packages (including Python modules) for all user accounts at the same time, which I like. When I installed tensorflow-gpu using pip, I was in fact frustrated because I couldn't figure out how to deploy it across multiple user accounts at one time. I ended up installing it three times, once in each account. You're suggesting that's actually preferred, at least when pip is performing the installation. OK, I will endure the repetition. From musbur at posteo.org Sun Dec 1 06:24:11 2019 From: musbur at posteo.org (musbur at posteo.org) Date: Sun, 1 Dec 2019 12:24:11 +0100 Subject: "Don't install on the system Python" In-Reply-To: References: <20191201092947.556bceab@arcor.com> <20191201084137.GA84892@cskk.homeip.net>

Message-ID: <20191201122411.3d01a97c@nxp10225> On Sun, 1 Dec 2019 01:33:50 -0800 (PST) John Ladasky wrote: > The only thing I must install with pip is tensorflow-gpu. For > everything else, I make use of the Ubuntu repositories. The Synaptic > package manager installs packages (including Python modules) for all > user accounts at the same time, which I like. > > When I installed tensorflow-gpu using pip, I was in fact frustrated > because I couldn't figure out how to deploy it across multiple user > accounts at one time. I ended up installing it three times, once in > each account. You're suggesting that's actually preferred, at least > when pip is performing the installation. OK, I will endure the > repetition. You can set up a system-wide virtualenv (for instance in /usr/local/lib/myenv) and use pip install as root to set up everything into that. All the normal users have to do then is prepend /usr/local/lib/myenv/bin to their PATH. After that, you have a system-wide consistent distribution of all your needed Python packages. You can then uninstall all python packages provided by the Linux distro which you don't need. At the moment it seems as if all you need to install locally with pip is tensorflow-gpu. This will change once some future version of tensorflow-gpu depends on newer versions of the system-provided packges. When that happens, pip will pull all those packages into the user's local venv, and it will have to do that individually for each user. BTW, it took me a long time to embrace Python's "virtualenv" concept because I had a hard time figuring out what it was and how it worked. Turns out that there is no magic involved, and that "virtual environment" is a misnomer. It is simply a full Python environment in a separate location on your system. Nothing virtual about it. From chema at rinzewind.org Sun Dec 1 09:39:56 2019 From: chema at rinzewind.org (=?iso-8859-1?Q?Jos=E9_Mar=EDa?= Mateos) Date: Sun, 1 Dec 2019 09:39:56 -0500 Subject: Pickle caching objects? In-Reply-To: References: <20191130220545.GA12875@equipaje> Message-ID: <20191201143956.GB3751@equipaje> On Sun, Dec 01, 2019 at 12:26:15PM +1100, Chris Angelico wrote: >I can't answer your question authoritatively, but I can suggest a >place to look. Python's memory allocator doesn't always return memory >to the system when the objects are freed up, for various reasons >including the way that memory pages get allocated from. But it >internally knows which parts are in use and which parts aren't. You're >seeing the RSS go down slightly at some points, which would be the >times when entire pages can be released; but other than that, what >you'll end up with is a sort of high-water-mark with lots of unused >space inside it. > >So what you're seeing isn't actual objects being cached, but just >memory ready to be populated with future objects. Thank you and Richard for your responses, this makes perfect sense now. Cheers, -- Jos? Mar?a (Chema) Mateos || https://rinzewind.org/ From torriem at gmail.com Sun Dec 1 10:41:01 2019 From: torriem at gmail.com (Michael Torrie) Date: Sun, 1 Dec 2019 08:41:01 -0700 Subject: "Don't install on the system Python" In-Reply-To: <0a065288-f851-4afc-9104-f5b5ca5188f9@googlegroups.com> References: <0a065288-f851-4afc-9104-f5b5ca5188f9@googlegroups.com> Message-ID: <6519290e-942a-340d-1eac-ee9f5a2c858f@gmail.com> On 11/30/19 9:42 PM, John Ladasky wrote: > Can anyone provide concrete examples of problems arising from > installing modules on top of the system Python? Am I courting > disaster? No you aren't. I've also never had any problems. I've installed many things into my root system Python installation with pip including PyQt5. It's just easier for me to have them in the system installation. I'm on a CentOS 7 box, which depends on Python 2 for a lot of system functions. I've moved to Python 3 now, so I mess with the system Python less and less. I understand that I probably should be using a virtualenv and pip installing into that, but I'm just too lazy. A couple of years ago I even managed to upgrade the system Python from 2.6 to 2.7 without any issues. I ended up making an RPM that neatly upgraded the system one. I also have upgraded the gtk2 bindings and the GTK2 library itself (again building RPMs), and everything worked fine, even the graphical centos utilities. The only problems I've ever heard of come from trying to manually remove stuff from Python that the system depended on. I've never heard of any problems installing additional modules. 90% of the time the module you need will be in the repositories, so no worries there at all. And not much to worry about for the rest. From tim at akwebsoft.com Sun Dec 1 11:27:26 2019 From: tim at akwebsoft.com (Tim Johnson) Date: Sun, 1 Dec 2019 07:27:26 -0900 Subject: ModuleNotFoundError with click module In-Reply-To: References: <32cc3ffc-bb43-284f-dfd8-7451ba100ec7@akwebsoft.com> Message-ID: On 12/1/19 12:26 AM, Peter Otten wrote: > Tim Johnson wrote: > >> Using linux ubuntu 16.04 with bash shell. >> Am retired python programmer, but not terribly current. >> I have moderate bash experience. >> >> When trying to install pgadmin4 via apt I get the following error >> traceback when pgadmin4 is invoked: >> >> Traceback (most recent call last): snipped ... >> File "/usr/local/lib/python3.7/site-packages/flask/cli.py", line 25, in >> >> import click >> ModuleNotFoundError: No module named 'click' >> >> >> If I invoke python3 (/usr/local/bin/python3), version 3.7.2 and invoke >> >>> import click >> click is imported successfully. > ... > I'm too lazy to look into the details of your paths -- I'd just make sure > that click is installed with the same interpreter and user as pgadmin4, e. > g. globally > > $ sudo /usr/local/bin/python3 -m pip install click > $ sudo /usr/local/bin/python3 path/to/setup.py install # or whatever it > takes to install pgadmin4 Like I said, I'm not current. Yikes. Now I have /usr/local/lib/python3.7/site-packages/clic-0.1.3.dist-info/ After I have my coffee I will attempt to proceed from there with whatever it takes to finalize thanks -- Tim tj49.com From rmlibre at riseup.net Sun Dec 1 13:10:20 2019 From: rmlibre at riseup.net (rmlibre at riseup.net) Date: Sun, 01 Dec 2019 10:10:20 -0800 Subject: tab replace to space 4 (rmlibre) In-Reply-To: References: Message-ID: <761066994586319cea87cbd542adeeec@riseup.net> > Its just that I've just began to touch tkinter, and would like to know of > bug-related pitfalls before I waste energy on trying to figure out what I > did wrong. :-\ One thing which is not obvious or easy to debug: Text widgets have some kind of inefficiency related to really long lines that don't have line breaks. The result is that your cpu will be completely gobbled up if you have a sufficiently long line. I haven't run any tests to get an accurate view of what the breaking point is. But one the cores in my cpu was hold up at 100% when displaying many many thousands of characters without line breaks. After lots of debugging and optimization finding, I just happened to google for "cpu load issue with" each of the widgets I was using and discovered that this indeed was the cause of the cpu load. Swapping in line breaks at regular, known intervals quickly fixed this. tl;dr Google for "issue with tkinter " for the widgets you'll be using, or " with tkinter" if you're experiencing a particular issue and are not sure if it could be caused for some reason by tkinter. From Richard at Damon-Family.org Sun Dec 1 13:35:31 2019 From: Richard at Damon-Family.org (Richard Damon) Date: Sun, 1 Dec 2019 13:35:31 -0500 Subject: "Don't install on the system Python" In-Reply-To: <6519290e-942a-340d-1eac-ee9f5a2c858f@gmail.com> References: <0a065288-f851-4afc-9104-f5b5ca5188f9@googlegroups.com> <6519290e-942a-340d-1eac-ee9f5a2c858f@gmail.com> Message-ID: <7fa2eec8-b5f7-45bf-79d0-4c4cb624e1a9@Damon-Family.org> On 12/1/19 10:41 AM, Michael Torrie wrote: > On 11/30/19 9:42 PM, John Ladasky wrote: >> Can anyone provide concrete examples of problems arising from >> installing modules on top of the system Python? Am I courting >> disaster? > No you aren't. I've also never had any problems. I've installed many > things into my root system Python installation with pip including PyQt5. > It's just easier for me to have them in the system installation. I'm > on a CentOS 7 box, which depends on Python 2 for a lot of system > functions. I've moved to Python 3 now, so I mess with the system Python > less and less. I understand that I probably should be using a > virtualenv and pip installing into that, but I'm just too lazy. > > A couple of years ago I even managed to upgrade the system Python from > 2.6 to 2.7 without any issues. I ended up making an RPM that neatly > upgraded the system one. I also have upgraded the gtk2 bindings and the > GTK2 library itself (again building RPMs), and everything worked fine, > even the graphical centos utilities. > > The only problems I've ever heard of come from trying to manually remove > stuff from Python that the system depended on. I've never heard of any > problems installing additional modules. 90% of the time the module you > need will be in the repositories, so no worries there at all. And not > much to worry about for the rest. My guess is that the issue is with some more complicated/esoteric packages. Especially if there are ones that don't maintain strict backwards compatibility, so that some packages using them have a maximum usable version as well as a minimum usable version. This can lead to troubles as some packages become incompatible because one needs a version greater than x, while another needs a version less than x. -- Richard Damon From rosuav at gmail.com Sun Dec 1 13:44:10 2019 From: rosuav at gmail.com (Chris Angelico) Date: Mon, 2 Dec 2019 05:44:10 +1100 Subject: "Don't install on the system Python" In-Reply-To: <0a065288-f851-4afc-9104-f5b5ca5188f9@googlegroups.com> References: <0a065288-f851-4afc-9104-f5b5ca5188f9@googlegroups.com> Message-ID: On Sun, Dec 1, 2019 at 3:46 PM John Ladasky wrote: > > Long-time Ubuntu user here. > > For years, I've read warnings about not installing one's personal stack of Python modules on top of the system Python. It is possible to corrupt the OS, or so I've gathered. > > Can anyone provide concrete examples of problems arising from installing modules on top of the system Python? Am I courting disaster? > I'm going to start by separating out two concepts that are often, but not always, the same. The "system Python" is the one that the OS depends on. On my Debian Stretch, that's /usr/bin/python3 (symlink to /usr/bin/python3.5). On older systems, that might be a Python 2.7 (or worse). This is the installation of Python that is managed by your OS package manager, and - more importantly - is the one that any OS-provided scripts will depend on. The "default Python" is the one you get when you type "python3" at the shell. Often this is the same as the system Python, but there's no requirement for this to be the case. On many of my systems, I compile and install a new build of Python periodically (usually from the master branch, so it's a pre-alpha), and I'm happy for that to take over the name "python3". Currently, on my main system (the aforementioned Debian Stretch), that's /usr/local/bin/python3 (a symlink to /usr/local/bin/python3.9). Since the system Python is managed by your OS package manager, you need to be careful about using pip to install packages into it. For instance, if I were to "/usr/bin/python3 -m pip install psycopg2", it would potentially conflict with "sudo apt install python3-psycopg2". You MAY be safe using pip to install something that isn't available in your package manager, but it's possible to run into dependency versioning conflicts. Be careful. But installing into your default Python, if it's not the system Python, is absolutely safe. Or rather, if it breaks anything, then it's not your fault - it's the fault of something depending on the system Python but not using an absolute shebang :) ChrisA From tim at akwebsoft.com Sun Dec 1 19:41:28 2019 From: tim at akwebsoft.com (Tim Johnson) Date: Sun, 1 Dec 2019 15:41:28 -0900 Subject: ModuleNotFoundError with click module In-Reply-To: References: <32cc3ffc-bb43-284f-dfd8-7451ba100ec7@akwebsoft.com> Message-ID: <49e16310-3ef8-acf2-9583-6843a81f1bd6@akwebsoft.com> On 12/1/19 12:26 AM, Peter Otten wrote: > Tim Johnson wrote: > >> Using linux ubuntu 16.04 with bash shell. >> Am retired python programmer, but not terribly current. >> I have moderate bash experience. >> >> When trying to install pgadmin4 via apt I get the following error >> traceback when pgadmin4 is invoked: >> >> Traceback (most recent call last): >> File "setup.py", line 17, in >> from pgadmin.model import db, User, Version, ServerGroup, Server, \ >> File "/usr/share/pgadmin4/web/pgadmin/__init__.py", line 19, in >> from flask import Flask, abort, request, current_app, session, url_for >> File "/usr/local/lib/python3.7/site-packages/flask/__init__.py", line >> 21, in >> from .app import Flask >> File "/usr/local/lib/python3.7/site-packages/flask/app.py", line 34, >> in >> from . import cli >> File "/usr/local/lib/python3.7/site-packages/flask/cli.py", line 25, in >> >> import click >> ModuleNotFoundError: No module named 'click' >> >> >> If I invoke python3 (/usr/local/bin/python3), version 3.7.2 and invoke >> >>> import click >> click is imported successfully. >> >> In this invocation, sys.path is: >> ['', '/usr/local/lib/python37.zip', '/usr/local/lib/python3.7', >> '/usr/local/lib/python3.7/lib-dynload', >> '/home/tim/.local/lib/python3.7/site-packages', >> '/usr/local/lib/python3.7/site-packages'] >> >> $PYTHONPATH is empty when the bash shell is invoked >> >> $PATH as follows: >> > /home/tim/bin:/home/tim/.local/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin:/usr/games:/usr/local/games:/snap/bin >> click.py can be found at >> /usr/local/lib/python3.7/site-packages/pipenv/patched/piptools/ >> in turn click.py imports click, presumably as the package, >> which appears to be at >> /usr/local/lib/python3.7/site-packages/pipenv/vendor/click >> >> Any number of settings of PYTHONPATH to the various paths above has >> failed to resolve the ModuleNotFoundError >> Same issues with attempting install from a virtual environment. >> >> Any help will be appreciated. >> thanks >> tim >> > I'm too lazy to look into the details of your paths -- I'd just make sure > that click is installed with the same interpreter and user as pgadmin4, e. > g. globally > > $ sudo /usr/local/bin/python3 -m pip install click > $ sudo /usr/local/bin/python3 path/to/setup.py install # or whatever it > takes to install pgadmin4 OK. Now I have /usr/local/lib/python3.7/site-packages/Click-7.0.dist-info/ which holds the following files: INSTALLER? LICENSE.txt? METADATA? RECORD? top_level.txt? WHEEL I haven't a clue as to how to proceed! Never seen this before ... Furthermore, google is offering me nothing conclusive. Where to go from here! -- Tim tj49.com From tim at akwebsoft.com Sun Dec 1 20:19:45 2019 From: tim at akwebsoft.com (Tim Johnson) Date: Sun, 1 Dec 2019 16:19:45 -0900 Subject: ModuleNotFoundError with click module In-Reply-To: <49e16310-3ef8-acf2-9583-6843a81f1bd6@akwebsoft.com> References: <32cc3ffc-bb43-284f-dfd8-7451ba100ec7@akwebsoft.com> <49e16310-3ef8-acf2-9583-6843a81f1bd6@akwebsoft.com> Message-ID: <510be00b-d87c-8054-cb5d-efb9179bd43b@akwebsoft.com> On 12/1/19 3:41 PM, Tim Johnson wrote: > > On 12/1/19 12:26 AM, Peter Otten wrote: >> Tim Johnson wrote: >> >>> Using linux ubuntu 16.04 with bash shell. >>> Am retired python programmer, but not terribly current. >>> I have moderate bash experience. >>> >>> When trying to install pgadmin4 via apt I get the following error >>> traceback when pgadmin4 is invoked: >>> >>> Traceback (most recent call last): >>> ? File "setup.py", line 17, in >>> ? from pgadmin.model import db, User, Version, ServerGroup, Server, \ >>> ? File "/usr/share/pgadmin4/web/pgadmin/__init__.py", line 19, in >>> >>> ? from flask import Flask, abort, request, current_app, session, >>> url_for >>> ? File "/usr/local/lib/python3.7/site-packages/flask/__init__.py", line >>> 21, in >>> ? from .app import Flask >>> ? File "/usr/local/lib/python3.7/site-packages/flask/app.py", line 34, >>> in >>> ? from . import cli >>> File "/usr/local/lib/python3.7/site-packages/flask/cli.py", line 25, in >>> >>> import click >>> ModuleNotFoundError: No module named 'click' >>> >>> >>> If I invoke python3 (/usr/local/bin/python3), version 3.7.2 and invoke >>> ? >>> import click >>> click is imported successfully. >>> >>> In this invocation, sys.path is: >>> ['', '/usr/local/lib/python37.zip', '/usr/local/lib/python3.7', >>> '/usr/local/lib/python3.7/lib-dynload', >>> '/home/tim/.local/lib/python3.7/site-packages', >>> '/usr/local/lib/python3.7/site-packages'] >>> >>> $PYTHONPATH is empty when the bash shell is invoked >>> >>> $PATH as follows: >>> >> /home/tim/bin:/home/tim/.local/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin:/usr/games:/usr/local/games:/snap/bin >> >>> click.py can be found at >>> /usr/local/lib/python3.7/site-packages/pipenv/patched/piptools/ >>> in turn click.py imports click, presumably as the package, >>> which appears to be at >>> /usr/local/lib/python3.7/site-packages/pipenv/vendor/click >>> >>> Any number of settings of PYTHONPATH to the various paths above has >>> failed to resolve the ModuleNotFoundError >>> Same issues with attempting install from a virtual environment. >>> >>> Any help will be appreciated. >>> thanks >>> tim >>> >> I'm too lazy to look into the details of your paths -- I'd just make >> sure >> that click is installed with the same interpreter and user as >> pgadmin4, e. >> g. globally >> >> $ sudo /usr/local/bin/python3 -m pip install click >> $ sudo /usr/local/bin/python3 path/to/setup.py install? # or whatever it >> takes to install pgadmin4 > > OK. Now I have > > /usr/local/lib/python3.7/site-packages/Click-7.0.dist-info/ > > which holds the following files: > > INSTALLER? LICENSE.txt? METADATA? RECORD? top_level.txt? WHEEL > > I haven't a clue as to how to proceed! Never seen this before ... > > Furthermore, google is offering me nothing conclusive. > > Where to go from here! P.S. It looks like that directory is sort of a stub; regardless of my take on it I am no longer having the ModuleNotFoundError. Peter has a been a great help. Couldn't have done it without him. cheers -- Tim tj49.com From python.list at tim.thechases.com Sun Dec 1 21:50:16 2019 From: python.list at tim.thechases.com (Tim Chase) Date: Sun, 1 Dec 2019 20:50:16 -0600 Subject: increasing the page size of a dbm store? In-Reply-To: <0EAFC9E0-02D2-4745-99B7-654B7A445EDD@barrys-emacs.org> References: <20191126192432.535ee241@bigbox.attlocal.net> <0EAFC9E0-02D2-4745-99B7-654B7A445EDD@barrys-emacs.org> Message-ID: <20191201205016.577673f9@bigbox.attlocal.net> > Maybe port to SQLite? I would not choose dbm these days. After sparring with it a while, I tweaked the existing job so that it chunked things into dbm-appropriate sizes to limp through; for the subsequent job (where I would have used dbm again) I went ahead and switched to sqlite and had no further issues. I'm not sure if it's worth mentioning the issue in the docs for the dbm module so others don't bump against it. I'm not sure if the limit is on sum(size(key) for key in db) or the number of keys total. Just not the sort of thing I'd want someone to be depending on, unaware of the potential pitfalls. Thanks, -tkc From niktnobodynikt at gmail.com Mon Dec 2 02:52:44 2019 From: niktnobodynikt at gmail.com (niktnobodynikt at gmail.com) Date: Sun, 1 Dec 2019 23:52:44 -0800 (PST) Subject: "Don't install on the system Python" In-Reply-To: <0a065288-f851-4afc-9104-f5b5ca5188f9@googlegroups.com> References: <0a065288-f851-4afc-9104-f5b5ca5188f9@googlegroups.com> Message-ID: W dniu niedziela, 1 grudnia 2019 05:42:35 UTC+1 u?ytkownik John Ladasky napisa?: > For years, I've read warnings about not installing one's personal stack of Python modules on top of the system Python. It is possible to corrupt the OS, or so I've gathered. > > Well, I've never heeded this advice, and so far nothing bad has happened to me. I don't like Anaconda, or virtual environments in general. I don't like heavyweight IDE's. I like to be able to type "python3" at the command prompt and be sure what I'll be getting. I have multiple user accounts on a system that I manage, and I want every user account to have access to the same modules. > > Maybe the modules that I require are safe to install on the system Python, I'm not sure. My must-haves are mostly scientific computing and data management modules: Numpy, Scipy, Scikit-learn, Matplotlib, Pandas, Biopython, and Tensorflow. I also use PyQt5 from time to time. > > Can anyone provide concrete examples of problems arising from installing modules on top of the system Python? Am I courting disaster? I did not heard of such problems. But I have another warning. Ubuntu 18.4 uses Python 3.6. Do not try to install 3.7 or 3.8 as systemwide python3 version. It breaks some programs including standard terminal. From rosuav at gmail.com Mon Dec 2 03:15:58 2019 From: rosuav at gmail.com (Chris Angelico) Date: Mon, 2 Dec 2019 19:15:58 +1100 Subject: "Don't install on the system Python" In-Reply-To: References: <0a065288-f851-4afc-9104-f5b5ca5188f9@googlegroups.com> Message-ID: On Mon, Dec 2, 2019 at 6:56 PM wrote: > > W dniu niedziela, 1 grudnia 2019 05:42:35 UTC+1 u?ytkownik John Ladasky napisa?: > > > For years, I've read warnings about not installing one's personal stack of Python modules on top of the system Python. It is possible to corrupt the OS, or so I've gathered. > > > > Well, I've never heeded this advice, and so far nothing bad has happened to me. I don't like Anaconda, or virtual environments in general. I don't like heavyweight IDE's. I like to be able to type "python3" at the command prompt and be sure what I'll be getting. I have multiple user accounts on a system that I manage, and I want every user account to have access to the same modules. > > > > Maybe the modules that I require are safe to install on the system Python, I'm not sure. My must-haves are mostly scientific computing and data management modules: Numpy, Scipy, Scikit-learn, Matplotlib, Pandas, Biopython, and Tensorflow. I also use PyQt5 from time to time. > > > > Can anyone provide concrete examples of problems arising from installing modules on top of the system Python? Am I courting disaster? > > > I did not heard of such problems. But I have another warning. Ubuntu 18.4 uses Python 3.6. Do not try to install 3.7 or 3.8 as systemwide python3 version. It breaks some programs including standard terminal. > And that's where the distinction between "system Python" and "default Python" comes in. You are absolutely right that you shouldn't replace the *system* Python. However, changing what the command "python3" runs is safe. ChrisA From __peter__ at web.de Mon Dec 2 03:46:37 2019 From: __peter__ at web.de (Peter Otten) Date: Mon, 02 Dec 2019 09:46:37 +0100 Subject: ModuleNotFoundError with click module References: <32cc3ffc-bb43-284f-dfd8-7451ba100ec7@akwebsoft.com> <49e16310-3ef8-acf2-9583-6843a81f1bd6@akwebsoft.com> <510be00b-d87c-8054-cb5d-efb9179bd43b@akwebsoft.com> Message-ID: Tim Johnson wrote: >> OK. Now I have >> >> /usr/local/lib/python3.7/site-packages/Click-7.0.dist-info/ >> >> which holds the following files: >> >> INSTALLER LICENSE.txt METADATA RECORD top_level.txt WHEEL >> >> I haven't a clue as to how to proceed! Never seen this before ... Just leave it alone ;) >> Furthermore, google is offering me nothing conclusive. >> >> Where to go from here! > > P.S. It looks like that directory is sort of a stub; regardless of my > take on it I am no longer having the ModuleNotFoundError. Once you can import it you can find the actual module or package with $ /usr/bin/python3.7 -c 'import click; print(click.__file__)' In this case it's a package, so you'll probably see (something like) /usr/local/lib/python3.7/site-packages/click/__init__.py rather than /usr/local/lib/python3.7/site-packages/click.py /usr/local/lib/python3.7/site-packages/ From soyeomul at doraji.xyz Mon Dec 2 03:58:42 2019 From: soyeomul at doraji.xyz (=?utf-8?B?7Zmp67OR7Z2s?=) Date: Mon, 02 Dec 2019 17:58:42 +0900 Subject: tab replace to space 4 References: Message-ID: Hi, Gilmeh^^^ > We are Python people, aren't we? Looks good, i did copy it [1], and thanks^^^ [1] https://gitlab.com/soyeomul/test/blob/master/untabify.py Sincerely, -- ^????? _????_ ?????_^))// From aishan0403 at gmail.com Mon Dec 2 10:40:41 2019 From: aishan0403 at gmail.com (A S) Date: Mon, 2 Dec 2019 07:40:41 -0800 (PST) Subject: Extract sentences in nested parentheses using Python Message-ID: <7a365fa0-e721-4a95-8c30-d3661301a7b2@googlegroups.com> I am trying to extract all strings in nested parentheses (along with the parentheses itself) in my .txt file. Please see the sample .txt file that I have used in this example here: (https://drive.google.com/open?id=1UKc0ZgY9Fsz5O1rSeBCLqt5dwZkMaQgr). I have tried and done up three different codes but none of them seems to be able to extract all the nested parentheses. They can only extract a portion of the nested parentheses. Any advice on what I've done wrong could really help! Here are the three codes I have done so far: 1st attempt: import re from os.path import join def balanced_braces(args): parts = [] for arg in args: if '(' not in arg: continue chars = [] n = 0 for c in arg: if c == '(': if n > 0: chars.append(c) n += 1 elif c == ')': n -= 1 if n > 0: chars.append(c) elif n == 0: parts.append(''.join(chars).lstrip().rstrip()) chars = [] elif n > 0: chars.append(c) return parts with open('lan sample text file.txt','r') as fd: #for words in fd.readlines(): t1 = balanced_braces(fd); print(t1) Output: ['"xE\'", PUT(xx.xxxx.),"\'"', '"TRUuuuth"', "xxx(xx_ix as format 'xxxx-xx') lec &jgjsd_vnv.", '"xE\'", PUT(xx.xxxx.),"\'"', '"CUuuiiiiuth"', "xxx(xx_ix as format 'xxxx-xx') lec &jgjsd_vnv."] 2nd attempt: from pyparsing import nestedExpr matchedParens = nestedExpr('(',')') with open('lan sample text file.txt','r') as fd: for words in fd.readlines(): for e in matchedParens.searchString(words): print(e) Output: [['"xE\'"', ',', 'PUT', ['xx.xxxx.'], ',', '"\'"']] [['"TRUuuuth"']] [['xxx', ['xx_ix', 'as', 'format', "'xxxx-xx'"], 'gff', '&jfjfsj_jfjfj.']] [['xxx', ['xx_ix', 'as', 'format', "'xxxx-xx'"], 'lec', '&jgjsd_vnv.']] [['"xE\'"', ',', 'PUT', ['xx.xxxx.'], ',', '"\'"']] [['"CUuuiiiiuth"']] [['xxx', ['xx_ix', 'as', 'format', "'xxxx-xx'"], 'gff', '&jfjfsj_jfjfj.']] [['xxx', ['xx_ix', 'as', 'format', "'xxxx-xx'"], 'lec', '&jgjsd_vnv.']] 3rd attempt: def parse_segments(source, recurse=False): unmatched_count = 0 start_pos = 0 opened = False open_pos = 0 cur_pos = 0 finished = [] segments = [] for character in source: #scan for mismatched parenthesis: if character == '(': unmatched_count += 1 if not opened: open_pos = cur_pos opened = True if character == ')': unmatched_count -= 1 if opened and unmatched_count == 0: segment = source[open_pos:cur_pos+1] segments.append(segment) clean = source[start_pos:open_pos] if clean: finished.append(clean) opened = False start_pos = cur_pos+1 cur_pos += 1 # assert unmatched_count == 0 if start_pos != cur_pos: #get anything that was left over here finished.append(source[start_pos:cur_pos]) #now check on recursion: for item in segments: #get rid of bounding parentheses: pruned = item[1:-1] if recurse: results = parse_tags(pruned, recurse) finished.expand(results) else: finished.append(pruned) return finished with open('lan sample text file.txt','r') as fd: for words in fd.readlines(): t = parse_segments(words) print(t) Output: ['kkkkk;\n'] ['\n'] [' select xx', ' jdfjhf:jhfjj from xxxx_x_xx_L ;\n', '"xE\'", PUT(xx.xxxx.),"\'"'] ['quit; \n'] ['\n'] ['/* 1.xxxxx FROM xxxx_x_Ex_x */ \n'] ['proc sql; ', ';\n', '"TRUuuuth"'] ['hhhjhfjs as fdsjfsj:\n'] ['select * from djfkjd to jfkjs\n'] ['(\n'] ['SELECT abc AS abc1, abc_2_ AS efg, abc_fg, fkdkfj_vv, jjsflkl_ff, fjkdsf_jfkj\n'] ['\tFROM &xxx..xxx_xxx_xxE\n'] ["where ((xxx(xx_ix as format 'xxxx-xx') gff &jfjfsj_jfjfj.) and \n"] [' ', ')\n', "xxx(xx_ix as format 'xxxx-xx') lec &jgjsd_vnv."] [' );\n'] ['\n'] ['\n'] ['jjjjjj;\n'] ['\n'] [' select xx', ' jdfjhf:jhfjj from xxxx_x_xx_L ;\n', '"xE\'", PUT(xx.xxxx.),"\'"'] ['quit; \n'] ['\n'] ['/* 1.xxxxx FROM xxxx_x_Ex_x */ \n'] ['proc sql; ', ';\n', '"CUuuiiiiuth"'] ['hhhjhfjs as fdsjfsj:\n'] ['select * from djfkjd to jfkjs\n'] ['(SELECT abc AS abc1, abc_2_ AS efg, abc_fg, fkdkfj_vv, jjsflkl_ff, fjkdsf_jfkj\n'] ['\tFROM &xxx..xxx_xxx_xxE\n'] ["where ((xxx(xx_ix as format 'xxxx-xx') gff &jfjfsj_jfjfj.) and \n"] [' ', ')\n', "xxx(xx_ix as format 'xxxx-xx') lec &jgjsd_vnv."] [' );'] My intended Output that I am unable to get should look something like this: ("xE'", PUT(xx.xxxx.),"'") ("TRUuuuth") ( SELECT abc AS abc1, abc_2_ AS efg, abc_fg, fkdkfj_vv, jjsflkl_ff, fjkdsf_jfkj FROM &xxx..xxx_xxx_xxE where ((xxx(xx_ix as format 'xxxx-xx') gff &jfjfsj_jfjfj.) and (xxx(xx_ix as format 'xxxx-xx') lec &jgjsd_vnv.)) ) ("xE'", PUT(xx.xxxx.),"'") ("CUuuiiiiuth") (SELECT abc AS abc1, abc_2_ AS efg, abc_fg, fkdkfj_vv, jjsflkl_ff, fjkdsf_jfkj FROM &xxx..xxx_xxx_xxE where ((xxx(xx_ix as format 'xxxx-xx') gff &jfjfsj_jfjfj.) and (xxx(xx_ix as format 'xxxx-xx') lec &jgjsd_vnv.))(( )) ) From __peter__ at web.de Mon Dec 2 12:00:50 2019 From: __peter__ at web.de (Peter Otten) Date: Mon, 02 Dec 2019 18:00:50 +0100 Subject: Extract sentences in nested parentheses using Python References: <7a365fa0-e721-4a95-8c30-d3661301a7b2@googlegroups.com> Message-ID: A S wrote: I think I've seen this question before ;) > I am trying to extract all strings in nested parentheses (along with the > parentheses itself) in my .txt file. Please see the sample .txt file that > I have used in this example here: > (https://drive.google.com/open?id=1UKc0ZgY9Fsz5O1rSeBCLqt5dwZkMaQgr). > > I have tried and done up three different codes but none of them seems to > be able to extract all the nested parentheses. They can only extract a > portion of the nested parentheses. Any advice on what I've done wrong > could really help! > > Here are the three codes I have done so far: > > 1st attempt: > > import re > from os.path import join > > def balanced_braces(args): > parts = [] > for arg in args: > if '(' not in arg: > continue There could still be a ")" that you miss > chars = [] > n = 0 > for c in arg: > if c == '(': > if n > 0: > chars.append(c) > n += 1 > elif c == ')': > n -= 1 > if n > 0: > chars.append(c) > elif n == 0: > parts.append(''.join(chars).lstrip().rstrip()) > chars = [] > elif n > 0: > chars.append(c) > return parts It's probably easier to understand and implement when you process the complete text at once. Then arbitrary splits don't get in the way of your quest for ( and ). You just have to remember the position of the first opening ( and number of opening parens that have to be closed before you take the complete expression: level: 00011112222100 text: abc(def(gh))ij when we are here^ we need^ A tentative implementation: $ cat parse.py import re NOT_SET = object() def scan(text): level = 0 start = NOT_SET for m in re.compile("[()]").finditer(text): if m.group() == ")": level -= 1 if level < 0: raise ValueError("underflow: more closing than opening parens") if level == 0: # outermost closing parenthesis: # deliver enclosed string including parens. yield text[start:m.end()] start = NOT_SET elif m.group() == "(": if level == 0: # outermost opening parenthesis: remember position. assert start is NOT_SET start = m.start() level += 1 else: assert False if level > 0: raise ValueError("unclosed parens remain") if __name__ == "__main__": with open("lan sample text file.txt") as instream: text = instream.read() for chunk in scan(text): print(chunk) $ python3 parse.py ("xE'", PUT(xx.xxxx.),"'") ("TRUuuuth") From Chris.Clark at actian.com Mon Dec 2 12:26:00 2019 From: Chris.Clark at actian.com (Chris Clark) Date: Mon, 2 Dec 2019 17:26:00 +0000 Subject: array and struct 64-bit Linux change in behavior Python 3.7 and 2.7 Message-ID: Test case: import array array.array('L', [0]) # x.itemsize == 8 rather than 4 This works fine (returns 4) under Windows Python 3.7.3 64-bit build. Under Ubuntu; Python 2.7.15rc1, 3.6.5, 3.70b3 64-bit this returns 8. Documentation at https://docs.python.org/3/library/array.html explicitly states 'L' is for size 4. It impacts all uses types of array (e.g. reading from byte strings). The struct module is a little different: import struct x = struct.pack('L', 0) # len(x) ===8 rather than 4 This can be worked around by using '=L' - which is not well documented - so this maybe a doc issue. Wanted to post here for comments before opening a bug at https://bugs.python.org/ Is anyone seeing this under Debian/Ubuntu? Chris From Richard at Damon-family.org Mon Dec 2 12:47:06 2019 From: Richard at Damon-family.org (Richard Damon) Date: Mon, 2 Dec 2019 12:47:06 -0500 Subject: array and struct 64-bit Linux change in behavior Python 3.7 and 2.7 In-Reply-To: References: Message-ID: <0BD00D4B-5445-46F8-BDE8-168939332F63@Damon-family.org> On Dec 2, 2019, at 12:32 PM, Chris Clark wrote: > > ?Test case: > > import array > array.array('L', [0]) > # x.itemsize == 8 rather than 4 > > This works fine (returns 4) under Windows Python 3.7.3 64-bit build. > > Under Ubuntu; Python 2.7.15rc1, 3.6.5, 3.70b3 64-bit this returns 8. Documentation at https://docs.python.org/3/library/array.html explicitly states 'L' is for size 4. > It impacts all uses types of array (e.g. reading from byte strings). > > The struct module is a little different: > > import struct > x = struct.pack('L', 0) > # len(x) ===8 rather than 4 > > This can be worked around by using '=L' - which is not well documented - so this maybe a doc issue. > > Wanted to post here for comments before opening a bug at https://bugs.python.org/ > > Is anyone seeing this under Debian/Ubuntu? > > > Chris > Documentation that I see says *Minimum* size is 4, nothing says that it will be 4 I wouldn?t be surprized if ?I? gave you a size of 4 on that platform (and maybe even on many 32 bit platforms too) From rgaddi at highlandtechnology.invalid Mon Dec 2 12:55:16 2019 From: rgaddi at highlandtechnology.invalid (Rob Gaddi) Date: Mon, 2 Dec 2019 09:55:16 -0800 Subject: array and struct 64-bit Linux change in behavior Python 3.7 and 2.7 In-Reply-To: References:

Message-ID: On 12/2/19 9:26 AM, Chris Clark wrote: > Test case: > > import array > array.array('L', [0]) > # x.itemsize == 8 rather than 4 > > This works fine (returns 4) under Windows Python 3.7.3 64-bit build. > > Under Ubuntu; Python 2.7.15rc1, 3.6.5, 3.70b3 64-bit this returns 8. Documentation at https://docs.python.org/3/library/array.html explicitly states 'L' is for size 4. > It impacts all uses types of array (e.g. reading from byte strings). > > The struct module is a little different: > > import struct > x = struct.pack('L', 0) > # len(x) ===8 rather than 4 > > This can be worked around by using '=L' - which is not well documented - so this maybe a doc issue. > > Wanted to post here for comments before opening a bug at https://bugs.python.org/ > > Is anyone seeing this under Debian/Ubuntu? > > > Chris > I'd say not a bug, at least in array. Reading that array documentation you linked, 4 is explicitly the MINIMUM size in bytes, not the guaranteed size. The struct situation is, as you said, a bit different. I believe that with the default native alignment @, you're seeing 4-byte data padded to an 8-byte alignment, not 8-byte data. That does seem to go against what the struct documentation says, "Padding is only automatically added between successive structure members. No padding is added at the beginning or the end of the encoded struct." = alignment is documented as having the platform native byte-order, but the size and alignment is standardized as having no padding, which is exactly the behavior you seem to want. The documentation is a bit obtuse and scattered, but no more than any other. From tim at akwebsoft.com Mon Dec 2 14:48:39 2019 From: tim at akwebsoft.com (Tim Johnson) Date: Mon, 2 Dec 2019 10:48:39 -0900 Subject: ModuleNotFoundError with click module In-Reply-To: References: <32cc3ffc-bb43-284f-dfd8-7451ba100ec7@akwebsoft.com> <49e16310-3ef8-acf2-9583-6843a81f1bd6@akwebsoft.com> <510be00b-d87c-8054-cb5d-efb9179bd43b@akwebsoft.com> Message-ID: On 12/1/19 11:46 PM, Peter Otten wrote: > Tim Johnson wrote: > >>> OK. Now I have >>> >>> /usr/local/lib/python3.7/site-packages/Click-7.0.dist-info/ >>> >>> which holds the following files: >>> >>> INSTALLER LICENSE.txt METADATA RECORD top_level.txt WHEEL >>> >>> I haven't a clue as to how to proceed! Never seen this before ... > Just leave it alone ;) > >>> Furthermore, google is offering me nothing conclusive. >>> >>> Where to go from here! >> P.S. It looks like that directory is sort of a stub; regardless of my >> take on it I am no longer having the ModuleNotFoundError. > Once you can import it you can find the actual module or package with > > $ /usr/bin/python3.7 -c 'import click; print(click.__file__)' > > In this case it's a package, so you'll probably see (something like) > > /usr/local/lib/python3.7/site-packages/click/__init__.py > > rather than > > /usr/local/lib/python3.7/site-packages/click.py > /usr/local/lib/python3.7/site-packages/ Good Explanation. Thanks again Peter. new since python 1.5 -- Tim tj49.com From barry at barrys-emacs.org Mon Dec 2 16:25:05 2019 From: barry at barrys-emacs.org (Barry Scott) Date: Mon, 2 Dec 2019 21:25:05 +0000 Subject: array and struct 64-bit Linux change in behavior Python 3.7 and 2.7 In-Reply-To: References:

Message-ID: <475624D9-15D8-4A82-A8B4-5A174BA87DCD@barrys-emacs.org> > On 2 Dec 2019, at 17:55, Rob Gaddi wrote: > > On 12/2/19 9:26 AM, Chris Clark wrote: >> Test case: >> import array >> array.array('L', [0]) >> # x.itemsize == 8 rather than 4 >> This works fine (returns 4) under Windows Python 3.7.3 64-bit build. >> Under Ubuntu; Python 2.7.15rc1, 3.6.5, 3.70b3 64-bit this returns 8. Documentation at https://docs.python.org/3/library/array.html explicitly states 'L' is for size 4. >> It impacts all uses types of array (e.g. reading from byte strings). >> The struct module is a little different: >> import struct >> x = struct.pack('L', 0) >> # len(x) ===8 rather than 4 >> This can be worked around by using '=L' - which is not well documented - so this maybe a doc issue. >> Wanted to post here for comments before opening a bug at https://bugs.python.org/ >> Is anyone seeing this under Debian/Ubuntu? >> Chris > > I'd say not a bug, at least in array. Reading that array documentation you linked, 4 is explicitly the MINIMUM size in bytes, not the guaranteed size. I'm wondering how useful it is that for array you can read from a file but have no ideas how many bytes each item needs. If I have a file with int32_t in it I cannot from the docs know how to read that file into an array. > > The struct situation is, as you said, a bit different. I believe that with the default native alignment @, you're seeing 4-byte data padded to an 8-byte alignment, not 8-byte data. That does seem to go against what the struct documentation says, "Padding is only automatically added between successive structure members. No padding is added at the beginning or the end of the encoded struct." The 'L' in struct is documented for 3.7 to use 4 bytes, but in fact uses 8, on fedora 31. Doc bug? >>> x=struct.pack('L',0x102030405) >>> x b'\x05\x04\x03\x02\x01\x00\x00\x00' Given I have exact control with b, h, i, and q but L is not fixed in size I'm not sure how it can be used with certainty across OS and versions. Barry > > = alignment is documented as having the platform native byte-order, but the size and alignment is standardized as having no padding, which is exactly the behavior you seem to want. The documentation is a bit obtuse and scattered, but no more than any other. > -- > https://mail.python.org/mailman/listinfo/python-list From post at tinita.de Mon Dec 2 17:04:21 2019 From: post at tinita.de (=?ISO-8859-15?Q?Tina_M=FCller?=) Date: Mon, 2 Dec 2019 23:04:21 +0100 (CET) Subject: [ANN] PyYAML-5.2: YAML parser and emitter for Python Message-ID: ======================= Announcing PyYAML-5.2 ======================= A new release of PyYAML is now available: https://pypi.org/project/PyYAML/ This fixes some incompatibilities introduced in version 5.1 and also removes another possibility of loading arbitrary code. Changes ======= * Repair incompatibilities introduced with 5.1. The default Loader was changed, but several methods like add_constructor still used the old default https://github.com/yaml/pyyaml/pull/279 -- A more flexible fix for custom tag constructors https://github.com/yaml/pyyaml/pull/287 -- Change default loader for yaml.add_constructor https://github.com/yaml/pyyaml/pull/305 -- Change default loader for add_implicit_resolver, add_path_resolver * Make FullLoader safer by removing python/object/apply from the default FullLoader https://github.com/yaml/pyyaml/pull/347 -- Move constructor for object/apply to UnsafeConstructor * Fix bug introduced in 5.1 where quoting went wrong on systems with sys.maxunicode <= 0xffff https://github.com/yaml/pyyaml/pull/276 -- Fix logic for quoting special characters * Other PRs: https://github.com/yaml/pyyaml/pull/280 -- Update CHANGES for 5.1 Resources ========= PyYAML IRC Channel: #pyyaml on irc.freenode.net PyYAML homepage: https://github.com/yaml/pyyaml PyYAML documentation: http://pyyaml.org/wiki/PyYAMLDocumentation Source and binary installers: https://pypi.org/project/PyYAML/ GitHub repository: https://github.com/yaml/pyyaml/ Bug tracking: https://github.com/yaml/pyyaml/issues YAML homepage: http://yaml.org/ YAML-core mailing list: http://lists.sourceforge.net/lists/listinfo/yaml-core About PyYAML ============ YAML is a data serialization format designed for human readability and interaction with scripting languages. PyYAML is a YAML parser and emitter for Python. PyYAML features a complete YAML 1.1 parser, Unicode support, pickle support, capable extension API, and sensible error messages. PyYAML supports standard YAML tags and provides Python-specific tags that allow to represent an arbitrary Python object. PyYAML is applicable for a broad range of tasks from complex configuration files to object serialization and persistence. Example ======= >>> import yaml >>> yaml.full_load(""" ... name: PyYAML ... description: YAML parser and emitter for Python ... homepage: https://github.com/yaml/pyyaml ... keywords: [YAML, serialization, configuration, persistence, pickle] ... """) {'keywords': ['YAML', 'serialization', 'configuration', 'persistence', 'pickle'], 'homepage': 'https://github.com/yaml/pyyaml', 'description': 'YAML parser and emitter for Python', 'name': 'PyYAML'} >>> print(yaml.dump(_)) name: PyYAML homepage: https://github.com/yaml/pyyaml description: YAML parser and emitter for Python keywords: [YAML, serialization, configuration, persistence, pickle] Maintainers =========== The following people are currently responsible for maintaining PyYAML: * Ingy d?t Net * Tina Mueller * Matt Davis and many thanks to all who have contribributed! See: https://github.com/yaml/pyyaml/pulls Copyright ========= Copyright (c) 2017-2019 Ingy d?t Net Copyright (c) 2006-2016 Kirill Simonov The PyYAML module was written by Kirill Simonov . It is currently maintained by the YAML and Python communities. PyYAML is released under the MIT license. See the file LICENSE for more details. From PythonList at DancesWithMice.info Mon Dec 2 17:22:19 2019 From: PythonList at DancesWithMice.info (DL Neil) Date: Tue, 3 Dec 2019 11:22:19 +1300 Subject: Extract sentences in nested parentheses using Python In-Reply-To: References: <7a365fa0-e721-4a95-8c30-d3661301a7b2@googlegroups.com> Message-ID: <1d689634-c3db-dd3f-c432-111604d5a784@DancesWithMice.info> On 3/12/19 6:00 AM, Peter Otten wrote: > A S wrote: > I think I've seen this question before ;) In addition to 'other reasons' for @Peter's comment, it is a common ComSc worked-problem or assignment. (in which case, we'd appreciate being told that you/OP is asking for help with "homework") >> I am trying to extract all strings in nested parentheses (along with the >> parentheses itself) in my .txt file. Please see the sample .txt file that >> I have used in this example here: >> (https://drive.google.com/open?id=1UKc0ZgY9Fsz5O1rSeBCLqt5dwZkMaQgr). >> >> I have tried and done up three different codes but none of them seems to >> be able to extract all the nested parentheses. They can only extract a >> portion of the nested parentheses. Any advice on what I've done wrong >> could really help! One approach is to research in the hope that there are already existing tools or techniques which may help/save you from 'reinventing the wheel' - when you think about it, a re-statement of open-source objectives. How does the Python interpreter break-down Python (text) code into its constituent parts ("tokens") *including* parentheses? Such are made available in (perhaps) a lesser-known corner of the PSL (Python Standard Library). Might you be able to use one such tool? The ComSc technique which sprang to mind involves "stacks" (a LIFO data structure) and "RPN" (Reverse Polish Notation). Whereas we like people to take their turn when it comes to limited resources, eg to form a "queue" to purchase/pay for goods at the local store, which is "FIFO" (first-in, first-out); a "stack"/LIFO (last-in, first-out) can be problematic to put into practical application. There are plenty of Python implementations or you can 'roll your own' with a list. Again, I'd likely employ a "deque" from the PSL's Collections library (although as a "stack" rather than as a "double-ended queue"), because the optimisation comes "free". (to my laziness, but after some kind soul sweated-bullets to make it fast (in both senses) for 'the rest of us'!) > It's probably easier to understand and implement when you process the > complete text at once. Then arbitrary splits don't get in the way of your > quest for ( and ). You just have to remember the position of the first > opening ( and number of opening parens that have to be closed before you > take the complete expression: +1 but as a 'silver surfer', I don't like to be asked to "remember" anything! > level: 00011112222100 > we need^ Consider: original_text (the contents of the .txt file - add buffering if volumes are huge) current_text (the characters we have processed/"recognised" so-far) stack (what an original name for such a data-structure! Which will contain each of the partial parenthetical expressions found - but yet to be proven/made complete) set current_text to NULSTRING for each current_character in original_text: if current_character is LEFT_PARENTHESIS: push current_text to stack set current_text to LEFT_PARENTHESIS concatenate current_character with current_text if current_character is RIGHT_PARENTHESIS: # current_text is a parenthetical expression # do with it what you will pop the stack set current_text to the ex-stack string \ concat current_text's p-expn Once working: cover 'special cases' (after above loop), eg original_text which doesn't begin and/or end with parentheses; and error cases, eg pop-ping a NULSTRING, or thinking things are finished but the stack is not yet empty - likely events from unbalanced parentheses! original text = "abc(def(gh))ij" event 1: in-turn, concatenate characters "abc" as current_text event 2: locate (first) left-parenthesis, push current_text to stack(&) event 3: concatenate "(def" event 4: push, likewise event 5: concatenate "(gh" event 6: locate (first) right-parenthesis (matches to left-parenthesis begining the current_string!) result?: ?print current_text? event 7: pop stack and redefine current_text as "(def(gh)" event 8: repeat, per event 6 event 9: current_text will now become "(def(gh))" event 10: (special case) run-out of input at "(def(gh)ij" event 11: (special case) pop (!any) stack and report "abc(def(gh)" NB not sure of need for a "level" number; but if required, you can infer that at any time, from the depth/len() of the stack! NBB being a simple-boy, my preference is for a 'single layer' of code, cf @Peter's generator. Regardless the processes are "tokenisation" and "recognition". At the back of my mind, was the notion that you may (eventually) be required to work with more than parentheses, eg pair-wise square-brackets and/or quotation-marks. In which case, you will need to also 'push' the token and check for token-pairs when 'pop-ping', as well as (perhaps) recognising lists of tokens to tokenise instead of the two parenthesis characters alone. In which case, I'd take a serious look at the Python Language Services rather than taking a 'roll your own' approach! Contrarily, if this spec is 'it', then you might consider optimising the search processes which 'trigger' the two stack operations, by re-working the for-loop and utilising string.find() - prioritising whichever parenthesis is left-most/comes-first - assuming LtR text. (apologies if you have already tried this in one of your previous approaches) Unfortunately, such likely results in 'layers' of code, and a generator might well become the tool-of-choice (I say this before @Peter comes back and (quite deservedly) flays me alive!). WebRefs: Python Language Services: https://docs.python.org/3/library/language.html collections ? Container datatypes: https://docs.python.org/3/library/collections.html See also your ComSc text/reference materials. -- Regards =dn From christian at python.org Mon Dec 2 17:37:45 2019 From: christian at python.org (Christian Heimes) Date: Mon, 2 Dec 2019 23:37:45 +0100 Subject: array and struct 64-bit Linux change in behavior Python 3.7 and 2.7 In-Reply-To: <475624D9-15D8-4A82-A8B4-5A174BA87DCD@barrys-emacs.org> References:

<475624D9-15D8-4A82-A8B4-5A174BA87DCD@barrys-emacs.org> Message-ID: <94e9ab79-12dd-e891-be80-7df913d5fdf4@python.org> On 02/12/2019 22.25, Barry Scott wrote: > > >> On 2 Dec 2019, at 17:55, Rob Gaddi wrote: >> >> On 12/2/19 9:26 AM, Chris Clark wrote: >>> Test case: >>> import array >>> array.array('L', [0]) >>> # x.itemsize == 8 rather than 4 >>> This works fine (returns 4) under Windows Python 3.7.3 64-bit build. >>> Under Ubuntu; Python 2.7.15rc1, 3.6.5, 3.70b3 64-bit this returns 8. Documentation at https://docs.python.org/3/library/array.html explicitly states 'L' is for size 4. >>> It impacts all uses types of array (e.g. reading from byte strings). >>> The struct module is a little different: >>> import struct >>> x = struct.pack('L', 0) >>> # len(x) ===8 rather than 4 >>> This can be worked around by using '=L' - which is not well documented - so this maybe a doc issue. >>> Wanted to post here for comments before opening a bug at https://bugs.python.org/ >>> Is anyone seeing this under Debian/Ubuntu? >>> Chris >> >> I'd say not a bug, at least in array. Reading that array documentation you linked, 4 is explicitly the MINIMUM size in bytes, not the guaranteed size. > > I'm wondering how useful it is that for array you can read from a file but have no ideas how many bytes each item needs. > If I have a file with int32_t in it I cannot from the docs know how to read that file into an array. > >> >> The struct situation is, as you said, a bit different. I believe that with the default native alignment @, you're seeing 4-byte data padded to an 8-byte alignment, not 8-byte data. That does seem to go against what the struct documentation says, "Padding is only automatically added between successive structure members. No padding is added at the beginning or the end of the encoded struct." > > The 'L' in struct is documented for 3.7 to use 4 bytes, but in fact uses 8, on fedora 31. Doc bug? The documentation of the struct and array module carefully speak of "minimum size in bytes", "standard size" and "native size". It's easy to miss that it doesn't state just "size" for a reason. A long is not int32_t. The actual size of long and unsigned long depend on the ABI of your platform. The standard defined a long as *at least* 4 bytes. On Windows it's always a 32 bit data type. On POSIX sizeof(long) is usually the same as the size of a pointer, 4 bytes on 32 bit platforms and 8 bytes on 64 bit platforms. https://en.wikipedia.org/wiki/Integer_(computer_science)#Long_integer I agree that the behavior is confusing, even for C programmers. Please feel free to open a ticket and request an improvement of the documentation. Christian From torriem at gmail.com Mon Dec 2 18:49:17 2019 From: torriem at gmail.com (Michael Torrie) Date: Mon, 2 Dec 2019 16:49:17 -0700 Subject: increasing the page size of a dbm store? In-Reply-To: <20191201205016.577673f9@bigbox.attlocal.net> References: <20191126192432.535ee241@bigbox.attlocal.net> <0EAFC9E0-02D2-4745-99B7-654B7A445EDD@barrys-emacs.org> <20191201205016.577673f9@bigbox.attlocal.net> Message-ID: <4dd1ac29-8803-a1bd-3158-0cd90f392c48@gmail.com> On 12/1/19 7:50 PM, Tim Chase wrote: > After sparring with it a while, I tweaked the existing job so that it > chunked things into dbm-appropriate sizes to limp through; for the > subsequent job (where I would have used dbm again) I went ahead and > switched to sqlite and had no further issues. How did you replace a key/value store with a relational database? Is a SQLite database fast enough at this sort of thing that it wasn't really designed for? From Richard at Damon-Family.org Mon Dec 2 20:50:16 2019 From: Richard at Damon-Family.org (Richard Damon) Date: Mon, 2 Dec 2019 20:50:16 -0500 Subject: array and struct 64-bit Linux change in behavior Python 3.7 and 2.7 In-Reply-To: <475624D9-15D8-4A82-A8B4-5A174BA87DCD@barrys-emacs.org> References:

<475624D9-15D8-4A82-A8B4-5A174BA87DCD@barrys-emacs.org> Message-ID: <2b3d0c1a-aab4-9084-1011-975f911352d8@Damon-Family.org> On 12/2/19 4:25 PM, Barry Scott wrote: > >> On 2 Dec 2019, at 17:55, Rob Gaddi wrote: >> >> On 12/2/19 9:26 AM, Chris Clark wrote: >>> Test case: >>> import array >>> array.array('L', [0]) >>> # x.itemsize == 8 rather than 4 >>> This works fine (returns 4) under Windows Python 3.7.3 64-bit build. >>> Under Ubuntu; Python 2.7.15rc1, 3.6.5, 3.70b3 64-bit this returns 8. Documentation at https://docs.python.org/3/library/array.html explicitly states 'L' is for size 4. >>> It impacts all uses types of array (e.g. reading from byte strings). >>> The struct module is a little different: >>> import struct >>> x = struct.pack('L', 0) >>> # len(x) ===8 rather than 4 >>> This can be worked around by using '=L' - which is not well documented - so this maybe a doc issue. >>> Wanted to post here for comments before opening a bug at https://bugs.python.org/ >>> Is anyone seeing this under Debian/Ubuntu? >>> Chris >> I'd say not a bug, at least in array. Reading that array documentation you linked, 4 is explicitly the MINIMUM size in bytes, not the guaranteed size. > I'm wondering how useful it is that for array you can read from a file but have no ideas how many bytes each item needs. > If I have a file with int32_t in it I cannot from the docs know how to read that file into an array. > >> The struct situation is, as you said, a bit different. I believe that with the default native alignment @, you're seeing 4-byte data padded to an 8-byte alignment, not 8-byte data. That does seem to go against what the struct documentation says, "Padding is only automatically added between successive structure members. No padding is added at the beginning or the end of the encoded struct." > The 'L' in struct is documented for 3.7 to use 4 bytes, but in fact uses 8, on fedora 31. Doc bug? > >>>> x=struct.pack('L',0x102030405) >>>> x > b'\x05\x04\x03\x02\x01\x00\x00\x00' > > Given I have exact control with b, h, i, and q but L is not fixed in size I'm not sure how it can be used with certainty across OS and versions. > > Barry > Actually, you DON'T have exact control with those sizes, it just happens that all the platforms you are using happen to have the same size for those types. Welcome to the ambiguity in the C type system, the basic types are NOT fixed in size. L means 'Long' and as Christian said, that is 8 byte long on Linux-64 bit. 'L' is exactly the right type for interfacing with a routine defined as taking a long. The issue is that you don't know what type a int32_t will be (it might be int, or it might be long, and long might not be 32 bits, it will be at least 32 bits). Perhaps array could be extended so that it took '4' for a 4 byte integer and '8' for an 8 byte integer (maybe 'U4' and 'U8' for unsigned). Might as well also allow 1 and 2 for completeness for char and short (but those are currently consistent). -- Richard Damon From maisarah at avlinfotech.net Mon Dec 2 21:29:11 2019 From: maisarah at avlinfotech.net (Maisarah) Date: Tue, 03 Dec 2019 10:29:11 +0800 Subject: Python Error Still Occured on sklearn In-Reply-To: <16ec5f25a9c.f89474fd188788.8117256455982324362@avlinfotech.net> References: <16ec5f25a9c.f89474fd188788.8117256455982324362@avlinfotech.net> Message-ID: <16ec99711a1.129d57e36335551.7298109250136335338@avlinfotech.net> Thank you. Maisarah Binti Mohd Yusak Certified CPRE-FL & CTFL Software Tester, IT Team. AVL Infotech (Malaysia) Sdn. Bhd. ? L2-I-3, Enterprise - 4 , Technology Park Malaysia, Bukit Jalil, Kuala Lumpur, Malaysia -57000 Mobile: +6016 507 3051 Mail:?mailto:maisarah at avlinfotech.net LinkedIn: https://my.linkedin.com/in/maimoyu Web:?http://www.avlinfotech.net/ ---- On Mon, 02 Dec 2019 17:30:24 +0800 Maisarah wrote ---- Dear Admin, I have install and upgrade Cython as well.? I have modified and repaired and even update the library but error is still occurred: C:\Windows\system32>pip install -U scikit-learn Collecting scikit-learn ? Using cached https://files.pythonhosted.org/packages/1e/ce/9d8c88e68af0a5b5c5d78d8d2b7bcadfd45e1d6afc863ccb9aee30765b06/scikit-learn-0.21.3.tar.gz Requirement already satisfied, skipping upgrade: numpy>=1.11.0 in c:\users\user\appdata\local\programs\python\python38\lib\site-packages (from scikit-learn) (1.17.4) Requirement already satisfied, skipping upgrade: scipy>=0.17.0 in c:\users\user\appdata\local\programs\python\python38\lib\site-packages (from scikit-learn) (1.3.3) Requirement already satisfied, skipping upgrade: joblib>=0.11 in c:\users\user\appdata\local\programs\python\python38\lib\site-packages (from scikit-learn) (0.14.0) Installing collected packages: scikit-learn ??? Running setup.py install for scikit-learn ... error ??? ERROR: Command errored out with exit status 1: ???? command: 'c:\users\user\appdata\local\programs\python\python38\python.exe' -u -c 'import sys, setuptools, tokenize; sys.argv[0] = '"'"'C:\\Users\\user\\AppData\\Local\\Temp\\pip-install-caioz9bv\\scikit-learn\\setup.py'"'"'; __file__='"'"'C:\\Users\\user\\AppData\\Local\\Temp\\pip-install-caioz9bv\\scikit-learn\\setup.py'"'"';f=getattr(tokenize, '"'"'open'"'"', open)(__file__);code=f.read().replace('"'"'\r\n'"'"', '"'"'\n'"'"');f.close();exec(compile(code, __file__, '"'"'exec'"'"'))' install --record 'C:\Users\user\AppData\Local\Temp\pip-record-o9y5q4bk\install-record.txt' --single-version-externally-managed --compile ???????? cwd: C:\Users\user\AppData\Local\Temp\pip-install-caioz9bv\scikit-learn\ ??? Complete output (44 lines): ??? Partial import of sklearn during the build process. ??? No module named 'numpy.distutils._msvccompiler' in numpy.distutils; trying from distutils ??? Traceback (most recent call last): ????? File "c:\users\user\appdata\local\programs\python\python38\lib\site-packages\setuptools\msvc.py", line 489, in _find_latest_available_vc_ver ??????? return self.find_available_vc_vers()[-1] ??? IndexError: list index out of range ??? During handling of the above exception, another exception occurred: ??? Traceback (most recent call last): ????? File "", line 1, in ????? File "C:\Users\user\AppData\Local\Temp\pip-install-caioz9bv\scikit-learn\setup.py", line 290, in ??????? setup_package() ????? File "C:\Users\user\AppData\Local\Temp\pip-install-caioz9bv\scikit-learn\setup.py", line 286, in setup_package ??????? setup(**metadata) ????? File "c:\users\user\appdata\local\programs\python\python38\lib\site-packages\numpy\distutils\core.py", line 137, in setup ??????? config = configuration() ????? File "C:\Users\user\AppData\Local\Temp\pip-install-caioz9bv\scikit-learn\setup.py", line 174, in configuration ??????? config.add_subpackage('sklearn') ????? File "c:\users\user\appdata\local\programs\python\python38\lib\site-packages\numpy\distutils\misc_util.py", line 1033, in add_subpackage ??????? config_list = self.get_subpackage(subpackage_name, subpackage_path, ????? File "c:\users\user\appdata\local\programs\python\python38\lib\site-packages\numpy\distutils\misc_util.py", line 999, in get_subpackage ??????? config = self._get_configuration_from_setup_py( ????? File "c:\users\user\appdata\local\programs\python\python38\lib\site-packages\numpy\distutils\misc_util.py", line 941, in _get_configuration_from_setup_py ??????? config = setup_module.configuration(*args) ????? File "sklearn\setup.py", line 76, in configuration ??????? maybe_cythonize_extensions(top_path, config) ????? File "C:\Users\user\AppData\Local\Temp\pip-install-caioz9bv\scikit-learn\sklearn\_build_utils\__init__.py", line 42, in maybe_cythonize_extensions ??????? with_openmp = check_openmp_support() ????? File "C:\Users\user\AppData\Local\Temp\pip-install-caioz9bv\scikit-learn\sklearn\_build_utils\openmp_helpers.py", line 83, in check_openmp_support ??????? ccompiler.compile(['test_openmp.c'], output_dir='objects', ????? File "c:\users\user\appdata\local\programs\python\python38\lib\distutils\_msvccompiler.py", line 360, in compile ??????? self.initialize() ????? File "c:\users\user\appdata\local\programs\python\python38\lib\distutils\_msvccompiler.py", line 253, in initialize ??????? vc_env = _get_vc_env(plat_spec) ????? File "c:\users\user\appdata\local\programs\python\python38\lib\site-packages\setuptools\msvc.py", line 185, in msvc14_get_vc_env ??????? return EnvironmentInfo(plat_spec, vc_min_ver=14.0).return_env() ????? File "c:\users\user\appdata\local\programs\python\python38\lib\site-packages\setuptools\msvc.py", line 843, in __init__ ??????? self.si = SystemInfo(self.ri, vc_ver) ????? File "c:\users\user\appdata\local\programs\python\python38\lib\site-packages\setuptools\msvc.py", line 485, in __init__ ??????? self.vc_ver = vc_ver or self._find_latest_available_vc_ver() ????? File "c:\users\user\appdata\local\programs\python\python38\lib\site-packages\setuptools\msvc.py", line 492, in _find_latest_available_vc_ver ??????? raise distutils.errors.DistutilsPlatformError(err) ??? distutils.errors.DistutilsPlatformError: Microsoft Visual C++ 14.0 is required. Get it with "Microsoft Visual C++ Build Tools": https://visualstudio.microsoft.com/downloads/ ??? ---------------------------------------- ERROR: Command errored out with exit status 1: 'c:\users\user\appdata\local\programs\python\python38\python.exe' -u -c 'import sys, setuptools, tokenize; sys.argv[0] = '"'"'C:\\Users\\user\\AppData\\Local\\Temp\\pip-install-caioz9bv\\scikit-learn\\setup.py'"'"'; __file__='"'"'C:\\Users\\user\\AppData\\Local\\Temp\\pip-install-caioz9bv\\scikit-learn\\setup.py'"'"';f=getattr(tokenize, '"'"'open'"'"', open)(__file__);code=f.read().replace('"'"'\r\n'"'"', '"'"'\n'"'"');f.close();exec(compile(code, __file__, '"'"'exec'"'"'))' install --record 'C:\Users\user\AppData\Local\Temp\pip-record-o9y5q4bk\install-record.txt' --single-version-externally-managed --compile Check the logs for full command output. C:\Windows\system32>pip install --upgrade -U scikit-learn Collecting scikit-learn ? Using cached https://files.pythonhosted.org/packages/1e/ce/9d8c88e68af0a5b5c5d78d8d2b7bcadfd45e1d6afc863ccb9aee30765b06/scikit-learn-0.21.3.tar.gz Requirement already satisfied, skipping upgrade: numpy>=1.11.0 in c:\users\user\appdata\local\programs\python\python38\lib\site-packages (from scikit-learn) (1.17.4) Requirement already satisfied, skipping upgrade: scipy>=0.17.0 in c:\users\user\appdata\local\programs\python\python38\lib\site-packages (from scikit-learn) (1.3.3) Requirement already satisfied, skipping upgrade: joblib>=0.11 in c:\users\user\appdata\local\programs\python\python38\lib\site-packages (from scikit-learn) (0.14.0) Installing collected packages: scikit-learn ??? Running setup.py install for scikit-learn ... error ??? ERROR: Command errored out with exit status 1: ???? command: 'c:\users\user\appdata\local\programs\python\python38\python.exe' -u -c 'import sys, setuptools, tokenize; sys.argv[0] = '"'"'C:\\Users\\user\\AppData\\Local\\Temp\\pip-install-48dlq_5d\\scikit-learn\\setup.py'"'"'; __file__='"'"'C:\\Users\\user\\AppData\\Local\\Temp\\pip-install-48dlq_5d\\scikit-learn\\setup.py'"'"';f=getattr(tokenize, '"'"'open'"'"', open)(__file__);code=f.read().replace('"'"'\r\n'"'"', '"'"'\n'"'"');f.close();exec(compile(code, __file__, '"'"'exec'"'"'))' install --record 'C:\Users\user\AppData\Local\Temp\pip-record-arwze1t4\install-record.txt' --single-version-externally-managed --compile ???????? cwd: C:\Users\user\AppData\Local\Temp\pip-install-48dlq_5d\scikit-learn\ ??? Complete output (44 lines): ??? Partial import of sklearn during the build process. ??? No module named 'numpy.distutils._msvccompiler' in numpy.distutils; trying from distutils ??? Traceback (most recent call last): ????? File "c:\users\user\appdata\local\programs\python\python38\lib\site-packages\setuptools\msvc.py", line 489, in _find_latest_available_vc_ver ??????? return self.find_available_vc_vers()[-1] ??? IndexError: list index out of range ??? During handling of the above exception, another exception occurred: ??? Traceback (most recent call last): ????? File "", line 1, in ????? File "C:\Users\user\AppData\Local\Temp\pip-install-48dlq_5d\scikit-learn\setup.py", line 290, in ??????? setup_package() ????? File "C:\Users\user\AppData\Local\Temp\pip-install-48dlq_5d\scikit-learn\setup.py", line 286, in setup_package ??????? setup(**metadata) ????? File "c:\users\user\appdata\local\programs\python\python38\lib\site-packages\numpy\distutils\core.py", line 137, in setup ??????? config = configuration() ????? File "C:\Users\user\AppData\Local\Temp\pip-install-48dlq_5d\scikit-learn\setup.py", line 174, in configuration ??????? config.add_subpackage('sklearn') ????? File "c:\users\user\appdata\local\programs\python\python38\lib\site-packages\numpy\distutils\misc_util.py", line 1033, in add_subpackage ??????? config_list = self.get_subpackage(subpackage_name, subpackage_path, ????? File "c:\users\user\appdata\local\programs\python\python38\lib\site-packages\numpy\distutils\misc_util.py", line 999, in get_subpackage ??????? config = self._get_configuration_from_setup_py( ????? File "c:\users\user\appdata\local\programs\python\python38\lib\site-packages\numpy\distutils\misc_util.py", line 941, in _get_configuration_from_setup_py ??????? config = setup_module.configuration(*args) ????? File "sklearn\setup.py", line 76, in configuration ??????? maybe_cythonize_extensions(top_path, config) ????? File "C:\Users\user\AppData\Local\Temp\pip-install-48dlq_5d\scikit-learn\sklearn\_build_utils\__init__.py", line 42, in maybe_cythonize_extensions ??????? with_openmp = check_openmp_support() ????? File "C:\Users\user\AppData\Local\Temp\pip-install-48dlq_5d\scikit-learn\sklearn\_build_utils\openmp_helpers.py", line 83, in check_openmp_support ??????? ccompiler.compile(['test_openmp.c'], output_dir='objects', ????? File "c:\users\user\appdata\local\programs\python\python38\lib\distutils\_msvccompiler.py", line 360, in compile ??????? self.initialize() ????? File "c:\users\user\appdata\local\programs\python\python38\lib\distutils\_msvccompiler.py", line 253, in initialize ??????? vc_env = _get_vc_env(plat_spec) ????? File "c:\users\user\appdata\local\programs\python\python38\lib\site-packages\setuptools\msvc.py", line 185, in msvc14_get_vc_env ??????? return EnvironmentInfo(plat_spec, vc_min_ver=14.0).return_env() ????? File "c:\users\user\appdata\local\programs\python\python38\lib\site-packages\setuptools\msvc.py", line 843, in __init__ ??????? self.si = SystemInfo(self.ri, vc_ver) ????? File "c:\users\user\appdata\local\programs\python\python38\lib\site-packages\setuptools\msvc.py", line 485, in __init__ ??????? self.vc_ver = vc_ver or self._find_latest_available_vc_ver() ????? File "c:\users\user\appdata\local\programs\python\python38\lib\site-packages\setuptools\msvc.py", line 492, in _find_latest_available_vc_ver ??????? raise distutils.errors.DistutilsPlatformError(err) ??? distutils.errors.DistutilsPlatformError: Microsoft Visual C++ 14.0 is required. Get it with "Microsoft Visual C++ Build Tools": https://visualstudio.microsoft.com/downloads/ ??? ---------------------------------------- ERROR: Command errored out with exit status 1: 'c:\users\user\appdata\local\programs\python\python38\python.exe' -u -c 'import sys, setuptools, tokenize; sys.argv[0] = '"'"'C:\\Users\\user\\AppData\\Local\\Temp\\pip-install-48dlq_5d\\scikit-learn\\setup.py'"'"'; __file__='"'"'C:\\Users\\user\\AppData\\Local\\Temp\\pip-install-48dlq_5d\\scikit-learn\\setup.py'"'"';f=getattr(tokenize, '"'"'open'"'"', open)(__file__);code=f.read().replace('"'"'\r\n'"'"', '"'"'\n'"'"');f.close();exec(compile(code, __file__, '"'"'exec'"'"'))' install --record 'C:\Users\user\AppData\Local\Temp\pip-record-arwze1t4\install-record.txt' --single-version-externally-managed --compile Check the logs for full command output. C:\Windows\system32> Kindly advice in English language, please. Thank you. Maisarah Binti Mohd Yusak Certified CPRE-FL & CTFL Software Tester, IT Team. AVL Infotech (Malaysia) Sdn. Bhd. ? L2-I-3, Enterprise - 4 , Technology Park Malaysia, Bukit Jalil, Kuala Lumpur, Malaysia -57000 Mobile: +6016 507 3051 Mail:?mailto:maisarah at avlinfotech.net LinkedIn: https://my.linkedin.com/in/maimoyu Web:?http://www.avlinfotech.net/ From veek at dont-use-this.com Tue Dec 3 04:19:28 2019 From: veek at dont-use-this.com (Veek M) Date: Tue, 3 Dec 2019 09:19:28 -0000 (UTC) Subject: Extending property using a Subclass - single method - why Super(Baz, Baz).name.__set__ ? Message-ID: class Foo(object): @property def name(self): if hasattr(self, '_name'): print('Foo name', self._name) return self._name else: return 'default' @name.setter def name(self, value): print('Foo', self) self._name = value print(self._name) @name.deleter def name(self): print('del') self._name = None print('Foo', name) class Baz(Foo): @property def name(self): print('Baz wrapper around getter') return super().name @Foo.name.setter def name(self, value): print('Baz wrapper around setter') print(self) print(super(Baz,Baz).name, value) return super(Baz, Baz).name.__set__(self, value) b = Baz() print('print', b.name) b.name = 'v' print(b.name) Why do we user super(Baz, Baz) - are we setting a class variable called Baz.name which would trigger Baz._name = value? We are essentially doing: Foo.name.__set__(Baz, value) ? How come 'self' is not used.. like in the traditional property way where we pass an instance reference instead of a class? From __peter__ at web.de Tue Dec 3 06:16:12 2019 From: __peter__ at web.de (Peter Otten) Date: Tue, 03 Dec 2019 12:16:12 +0100 Subject: Extending property using a Subclass - single method - why Super(Baz, Baz).name.__set__ ? References: Message-ID: Veek M wrote: > class Foo(object): > @property > def name(self): > if hasattr(self, '_name'): > print('Foo name', self._name) > return self._name > else: > return 'default' > > @name.setter > def name(self, value): > print('Foo', self) > self._name = value > print(self._name) > > @name.deleter > def name(self): > print('del') > self._name = None > > print('Foo', name) > > class Baz(Foo): > @property > def name(self): > print('Baz wrapper around getter') > return super().name > > @Foo.name.setter This looks like a bug as the read-only property defined above is overwritten by a copy of the name poperty of the base class (with an updated setter). > def name(self, value): > print('Baz wrapper around setter') > print(self) > print(super(Baz,Baz).name, value) > return super(Baz, Baz).name.__set__(self, value) When you want to invoke a method of the base class property you have to look up that property in the base class. This is what super(Baz, Baz).name does. super().name for read access is an exception. The __get__() method is invoked implicitly which is probably also the reason why you cannot write super().name.somepropertymethod(...) for anything else. > > b = Baz() > print('print', b.name) > b.name = 'v' > print(b.name) > > Why do we user super(Baz, Baz) - are we setting a class variable called > Baz.name which would trigger Baz._name = value? > > We are essentially doing: > Foo.name.__set__(Baz, value) ? > > How come 'self' is not used.. like in the traditional property way where > we pass an instance reference instead of a class? From veek at dont-use-this.com Tue Dec 3 08:36:55 2019 From: veek at dont-use-this.com (Veek M) Date: Tue, 3 Dec 2019 13:36:55 -0000 (UTC) Subject: Extending property using a Subclass - single method - why Super(Baz, Baz).name.__set__ ? References:

Message-ID: you've misunderstood my question, let me try again: So this is a simple descriptor class and as you can see, dunder-set needs 3 args: the descriptor CONTAINER/Bar-instance is the first arg, then a reference to the using instance/Foo-instance class Bar(object): def __set__(self, instance, value): #b-instance of Bar, f-instance of Foo, value print(self, instance, value) class Foo(object): b = Bar() f = Foo() print(f) f.b = 10 1. Now when we create/use @property.. what is the first and second argument to dunder-set (my guess is, the @property is the first arg and the second arg is 'Foo' IF you do class Foo(object): @property def whatever.. Am I right? Is there a way to check? 2. The Class Bar/descriptor acts a wrapper/protector for some sekret _var and therefore it gets all the data needed to make a judgement call.. that is, it's own name/instance-ref and the using class/instance-name-ref Note that he's receiving instance-references therefore when I start sub-classing a property why does he then switch to class-references/class-variables From aishan0403 at gmail.com Tue Dec 3 08:41:18 2019 From: aishan0403 at gmail.com (A S) Date: Tue, 3 Dec 2019 05:41:18 -0800 (PST) Subject: Extract sentences in nested parentheses using Python In-Reply-To: References: <7a365fa0-e721-4a95-8c30-d3661301a7b2@googlegroups.com>

Message-ID: On Tuesday, 3 December 2019 01:01:25 UTC+8, Peter Otten wrote: > A S wrote: > > I think I've seen this question before ;) > > > I am trying to extract all strings in nested parentheses (along with the > > parentheses itself) in my .txt file. Please see the sample .txt file that > > I have used in this example here: > > (https://drive.google.com/open?id=1UKc0ZgY9Fsz5O1rSeBCLqt5dwZkMaQgr). > > > > I have tried and done up three different codes but none of them seems to > > be able to extract all the nested parentheses. They can only extract a > > portion of the nested parentheses. Any advice on what I've done wrong > > could really help! > > > > Here are the three codes I have done so far: > > > > 1st attempt: > > > > import re > > from os.path import join > > > > def balanced_braces(args): > > parts = [] > > for arg in args: > > if '(' not in arg: > > continue > > There could still be a ")" that you miss > > > chars = [] > > n = 0 > > for c in arg: > > if c == '(': > > if n > 0: > > chars.append(c) > > n += 1 > > elif c == ')': > > n -= 1 > > if n > 0: > > chars.append(c) > > elif n == 0: > > parts.append(''.join(chars).lstrip().rstrip()) > > chars = [] > > elif n > 0: > > chars.append(c) > > return parts > > It's probably easier to understand and implement when you process the > complete text at once. Then arbitrary splits don't get in the way of your > quest for ( and ). You just have to remember the position of the first > opening ( and number of opening parens that have to be closed before you > take the complete expression: > > level: 00011112222100 > text: abc(def(gh))ij > when we are here^ > we need^ > > A tentative implementation: > > $ cat parse.py > import re > > NOT_SET = object() > > def scan(text): > level = 0 > start = NOT_SET > for m in re.compile("[()]").finditer(text): > if m.group() == ")": > level -= 1 > if level < 0: > raise ValueError("underflow: more closing than opening > parens") > if level == 0: > # outermost closing parenthesis: > # deliver enclosed string including parens. > yield text[start:m.end()] > start = NOT_SET > elif m.group() == "(": > if level == 0: > # outermost opening parenthesis: remember position. > assert start is NOT_SET > start = m.start() > level += 1 > else: > assert False > if level > 0: > raise ValueError("unclosed parens remain") > > > if __name__ == "__main__": > with open("lan sample text file.txt") as instream: > text = instream.read() > for chunk in scan(text): > print(chunk) > $ python3 parse.py > ("xE'", PUT(xx.xxxx.),"'") > ("TRUuuuth") Hello Peter! I tried this on my actual working files and it returned this error: "unclosed parens remain". In this case, how can I continue to parse through my text files by only extracting those with balanced parentheses and ignore those that are incomplete? From __peter__ at web.de Tue Dec 3 10:47:49 2019 From: __peter__ at web.de (Peter Otten) Date: Tue, 03 Dec 2019 16:47:49 +0100 Subject: Extract sentences in nested parentheses using Python References: <7a365fa0-e721-4a95-8c30-d3661301a7b2@googlegroups.com>

Message-ID: A S wrote: > On Tuesday, 3 December 2019 01:01:25 UTC+8, Peter Otten wrote: >> A S wrote: >> >> I think I've seen this question before ;) >> >> > I am trying to extract all strings in nested parentheses (along with >> > the parentheses itself) in my .txt file. Please see the sample .txt >> > file that I have used in this example here: >> > (https://drive.google.com/open?id=1UKc0ZgY9Fsz5O1rSeBCLqt5dwZkMaQgr). >> > >> > I have tried and done up three different codes but none of them seems >> > to be able to extract all the nested parentheses. They can only extract >> > a portion of the nested parentheses. Any advice on what I've done wrong >> > could really help! >> > >> > Here are the three codes I have done so far: >> > >> > 1st attempt: >> > >> > import re >> > from os.path import join >> > >> > def balanced_braces(args): >> > parts = [] >> > for arg in args: >> > if '(' not in arg: >> > continue >> >> There could still be a ")" that you miss >> >> > chars = [] >> > n = 0 >> > for c in arg: >> > if c == '(': >> > if n > 0: >> > chars.append(c) >> > n += 1 >> > elif c == ')': >> > n -= 1 >> > if n > 0: >> > chars.append(c) >> > elif n == 0: >> > parts.append(''.join(chars).lstrip().rstrip()) >> > chars = [] >> > elif n > 0: >> > chars.append(c) >> > return parts >> >> It's probably easier to understand and implement when you process the >> complete text at once. Then arbitrary splits don't get in the way of your >> quest for ( and ). You just have to remember the position of the first >> opening ( and number of opening parens that have to be closed before you >> take the complete expression: >> >> level: 00011112222100 >> text: abc(def(gh))ij >> when we are here^ >> we need^ >> >> A tentative implementation: >> >> $ cat parse.py >> import re >> >> NOT_SET = object() >> >> def scan(text): >> level = 0 >> start = NOT_SET >> for m in re.compile("[()]").finditer(text): >> if m.group() == ")": >> level -= 1 >> if level < 0: >> raise ValueError("underflow: more closing than opening >> parens") >> if level == 0: >> # outermost closing parenthesis: >> # deliver enclosed string including parens. >> yield text[start:m.end()] >> start = NOT_SET >> elif m.group() == "(": >> if level == 0: >> # outermost opening parenthesis: remember position. >> assert start is NOT_SET >> start = m.start() >> level += 1 >> else: >> assert False >> if level > 0: >> raise ValueError("unclosed parens remain") >> >> >> if __name__ == "__main__": >> with open("lan sample text file.txt") as instream: >> text = instream.read() >> for chunk in scan(text): >> print(chunk) >> $ python3 parse.py >> ("xE'", PUT(xx.xxxx.),"'") >> ("TRUuuuth") > > Hello Peter! I tried this on my actual working files and it returned this > error: "unclosed parens remain". In this case, how can I continue to parse > through my text files by only extracting those with balanced parentheses > and ignore those that are incomplete? filenames = ... for filename in filenames: with open(filename) as instream: text = instream.read() try: chunks = list(scan(text)) except ValueError as err: print(f"{err} in file {filename!r}", file=sys.stderr) else: for chunk in chunks: print(chunk) From geremy85 at gmail.com Mon Dec 2 23:58:11 2019 From: geremy85 at gmail.com (geremy85 at gmail.com) Date: Mon, 2 Dec 2019 20:58:11 -0800 (PST) Subject: lxml question -- creating an etree.Element attribute with ':' in the name In-Reply-To: References:

Message-ID: <0fc93031-c19f-47ea-9539-c4d94e12314a@googlegroups.com> Theanks a lot From Karsten.Hilbert at gmx.net Tue Dec 3 11:00:45 2019 From: Karsten.Hilbert at gmx.net (Karsten Hilbert) Date: Tue, 3 Dec 2019 17:00:45 +0100 Subject: lxml question -- creating an etree.Element attribute with ':' in the name In-Reply-To: <0fc93031-c19f-47ea-9539-c4d94e12314a@googlegroups.com> References:

<0fc93031-c19f-47ea-9539-c4d94e12314a@googlegroups.com> Message-ID: <20191203160044.GB1277@hermes.hilbert.loc> On Mon, Dec 02, 2019 at 08:58:11PM -0800, geremy85 at gmail.com wrote: > Date: Mon, 2 Dec 2019 20:58:11 -0800 (PST) > From: geremy85 at gmail.com > To: python-list at python.org > Subject: Re: lxml question -- creating an etree.Element attribute with ':' > in the name > User-Agent: G2/1.0 > > Theanks a lot > -- > https://mail.python.org/mailman/listinfo/python-list you are welcoem -- GPG 40BE 5B0E C98E 1713 AFA6 5BC0 3BEA AC80 7D4F C89B From __peter__ at web.de Tue Dec 3 11:24:17 2019 From: __peter__ at web.de (Peter Otten) Date: Tue, 03 Dec 2019 17:24:17 +0100 Subject: Extending property using a Subclass - single method - why Super(Baz, Baz).name.__set__ ? References:

Message-ID: Veek M wrote: > you've misunderstood my question There were a lot of foobars bazzing in my head, but at least I tried ;) > , let me try again: > > So this is a simple descriptor class and as you can see, dunder-set needs > 3 args: the descriptor CONTAINER/Bar-instance is the first arg, then a > reference to the using instance/Foo-instance > > class Bar(object): > > def __set__(self, instance, value): > #b-instance of Bar, f-instance of Foo, value > print(self, instance, value) > > > class Foo(object): > b = Bar() > > f = Foo() > print(f) > f.b = 10 > > 1. Now when we create/use @property.. > what is the first and second argument to dunder-set > > (my guess is, the @property is the first arg and the second arg is 'Foo' > IF you do > > class Foo(object): > @property > def whatever.. > > Am I right? Is there a way to check? @foo def bar(...): ... is an alternative way to writing def bar(...): ... bar = foo(bar) so you can just add conforming __init__() and __get__() methods to your Bar descriptor and see for yourself: >> class Bar: ... def __init__(self, fget): pass ... def __set__(*args): print(args) ... def __get__(*args): pass ... >>> class Foo: ... @Bar ... def whatever(self): pass ... >>> foo = Foo() >>> foo.whatever = 42 (<__main__.Bar object at 0x7f92b55d70b8>, <__main__.Foo object at 0x7f92b55d70f0>, 42) > 2. The Class Bar/descriptor acts a wrapper/protector for some sekret _var > and therefore it gets all the data needed to make a judgement call.. that > is, it's own name/instance-ref and the using class/instance-name-ref > > Note that he's receiving instance-references > > therefore when I start sub-classing a property why does he then switch to > class-references/class-variables If you were subclassing a property you'd do class my_property(property): # tinker with __get/set/whatnot__ When you want to wrap a property defined in a superclass the property instance is not part of the class hierarchy that you are interested in. Instead of calling super() it has to figure out the base class of its "host" class manually. If you want overridable properties you can devise a way to look up the method: >>> class Foo: ... @property ... def name(self): return self.get_name() ... @name.setter ... def name(self, value): self.set_name(value) ... def set_name(self, value): self._name = value ... def get_name(self): return self._name ... >>> class Bar(Foo): ... def set_name(self, value): ... print("setting") ... super().set_name(value) ... def get_name(self): ... print("getting") ... return super().get_name() ... >>> foo = Foo() >>> foo.name = "bar" >>> foo.name 'bar' >>> bar = Bar() >>> bar.name = "baz" setting >>> bar.name getting 'baz' From python.list at tim.thechases.com Tue Dec 3 11:23:08 2019 From: python.list at tim.thechases.com (Tim Chase) Date: Tue, 3 Dec 2019 10:23:08 -0600 Subject: increasing the page size of a dbm store? In-Reply-To: <4dd1ac29-8803-a1bd-3158-0cd90f392c48@gmail.com> References: <20191126192432.535ee241@bigbox.attlocal.net> <0EAFC9E0-02D2-4745-99B7-654B7A445EDD@barrys-emacs.org> <20191201205016.577673f9@bigbox.attlocal.net> <4dd1ac29-8803-a1bd-3158-0cd90f392c48@gmail.com> Message-ID: <20191203102308.1c0911c8@bigbox.attlocal.net> On 2019-12-02 16:49, Michael Torrie wrote: > On 12/1/19 7:50 PM, Tim Chase wrote: > > After sparring with it a while, I tweaked the existing job so > > that it chunked things into dbm-appropriate sizes to limp > > through; for the subsequent job (where I would have used dbm > > again) I went ahead and switched to sqlite and had no further > > issues. > > How did you replace a key/value store with a relational database? > Is a SQLite database fast enough at this sort of thing that it > wasn't really designed for? It was certainly slower, though it wasn't so bad once I had proper indexing and submitted queries that pulled back multiple results in one query. But even with the slightly slower run-time aspects, it was still faster than starting a job (expecting it to run to completion overnight), having it crash, manually deleting my cache, and manually resuming from where it left off, all multiple times. And all said, since it was network I/O bound, once I had the populated cache (resulting cache.db file was about 1TB...thank goodness for transparent compression with ZFS), turnarounds took more like 30min rather than 3 days. More the "go work on something else and come back" than the "let it run overnight". -tkc From rgaddi at highlandtechnology.invalid Tue Dec 3 13:01:22 2019 From: rgaddi at highlandtechnology.invalid (Rob Gaddi) Date: Tue, 3 Dec 2019 10:01:22 -0800 Subject: array and struct 64-bit Linux change in behavior Python 3.7 and 2.7 In-Reply-To: References:

<475624D9-15D8-4A82-A8B4-5A174BA87DCD@barrys-emacs.org> <2b3d0c1a-aab4-9084-1011-975f911352d8@Damon-Family.org> Message-ID: On 12/2/19 5:50 PM, Richard Damon wrote: > > Perhaps array could be extended so that it took '4' for a 4 byte integer > and '8' for an 8 byte integer (maybe 'U4' and 'U8' for unsigned). Might > as well also allow 1 and 2 for completeness for char and short (but > those are currently consistent). > I will note that numpy arrays give exactly this level of control, as do ctypes arrays. The standard array library might just be the wrong tool for the job of reading binary data. From Chris.Clark at actian.com Wed Dec 4 00:15:05 2019 From: Chris.Clark at actian.com (Chris Clark) Date: Wed, 4 Dec 2019 05:15:05 +0000 Subject: array and struct 64-bit Linux change in behavior Python 3.7 and 2.7 In-Reply-To: <2b3d0c1a-aab4-9084-1011-975f911352d8@Damon-Family.org> References:

<475624D9-15D8-4A82-A8B4-5A174BA87DCD@barrys-emacs.org> <2b3d0c1a-aab4-9084-1011-975f911352d8@Damon-Family.org> Message-ID: Thanks for all the replies (and apologies for top posting, I have a brain dead email client ?). I think the consensus from the various threads is that the docs are either lacking or misleading. I mentioned that this impacts bytes and the problem there is more telling as it hard fails (this is how I first discovered this was an issue): >>> array.array('L', b'\0\0\0\0') Traceback (most recent call last): File "", line 1, in ValueError: string length not a multiple of item size I don't believe the documentation is accurate by using the word "minimum". Minimum would suggest that it would accept a 4-byte value as a minimum and on 64-bit it does *not*, it hard fails. If it were to document that, "the sizes are native integer types for the platform, the table documents some typical but *not* guaranteed sizes", that would be more clear. For struct - I think the '<' and '=' non-padding docs could benefit from some explanation.. I'm not sure what yet ? I saw a few suggestions on alternatives for size specifications, I'm definitely in favor of that (right now I'm probing I and L to determine size before using them for real). I don?t think U prefix would work as array really only accepts a single specifier. If array was to be updated to use multiple character specifiers I would recommend matching the struct specifier (which it is close to at the moment) format. For my uses case I'm seriously thinking about not using array moving forward and only using struct. I briefly wondered about ctypes (it has nice names, e.g. c_int64 that are unambiguous) but then I remembered it is not available in Jython). With the benefit of hindsight it would have been better if array (and struct) used stdint.h types, those types and lengths are explicitly documented. Regarding Barry's comment, yep size consistency with array is a pain - what I implemented as workaround is below (and likely to be my solution going forward): x = array.array('L', [0]) if x.itemsize == 4: FMT_ARRAY_4BYTE = 'L' FMT_STRUCT_4BYTE = ' Sent: Monday, December 2, 2019 5:50 PM To: python-list at python.org Subject: Re: array and struct 64-bit Linux change in behavior Python 3.7 and 2.7 On 12/2/19 4:25 PM, Barry Scott wrote: > >> On 2 Dec 2019, at 17:55, Rob Gaddi wrote: >> >> On 12/2/19 9:26 AM, Chris Clark wrote: >>> Test case: >>> import array >>> array.array('L', [0]) # x.itemsize == 8 rather than >>> 4 This works fine (returns 4) under Windows Python 3.7.3 64-bit >>> build. >>> Under Ubuntu; Python 2.7.15rc1, 3.6.5, 3.70b3 64-bit this returns 8. Documentation at https://docs.python.org/3/library/array.html explicitly states 'L' is for size 4. >>> It impacts all uses types of array (e.g. reading from byte strings). >>> The struct module is a little different: >>> import struct >>> x = struct.pack('L', 0) >>> # len(x) ===8 rather than 4 >>> This can be worked around by using '=L' - which is not well documented - so this maybe a doc issue. >>> Wanted to post here for comments before opening a bug at >>> https://bugs.python.org/ >>> Is anyone seeing this under Debian/Ubuntu? >>> Chris >> I'd say not a bug, at least in array. Reading that array documentation you linked, 4 is explicitly the MINIMUM size in bytes, not the guaranteed size. > I'm wondering how useful it is that for array you can read from a file but have no ideas how many bytes each item needs. > If I have a file with int32_t in it I cannot from the docs know how to read that file into an array. > >> The struct situation is, as you said, a bit different. I believe that with the default native alignment @, you're seeing 4-byte data padded to an 8-byte alignment, not 8-byte data. That does seem to go against what the struct documentation says, "Padding is only automatically added between successive structure members. No padding is added at the beginning or the end of the encoded struct." > The 'L' in struct is documented for 3.7 to use 4 bytes, but in fact uses 8, on fedora 31. Doc bug? > >>>> x=struct.pack('L',0x102030405) >>>> x > b'\x05\x04\x03\x02\x01\x00\x00\x00' > > Given I have exact control with b, h, i, and q but L is not fixed in size I'm not sure how it can be used with certainty across OS and versions. > > Barry > Actually, you DON'T have exact control with those sizes, it just happens that all the platforms you are using happen to have the same size for those types. Welcome to the ambiguity in the C type system, the basic types are NOT fixed in size. L means 'Long' and as Christian said, that is 8 byte long on Linux-64 bit. 'L' is exactly the right type for interfacing with a routine defined as taking a long. The issue is that you don't know what type a int32_t will be (it might be int, or it might be long, and long might not be 32 bits, it will be at least 32 bits). Perhaps array could be extended so that it took '4' for a 4 byte integer and '8' for an 8 byte integer (maybe 'U4' and 'U8' for unsigned). Might as well also allow 1 and 2 for completeness for char and short (but those are currently consistent). -- Richard Damon From rosuav at gmail.com Wed Dec 4 00:48:42 2019 From: rosuav at gmail.com (Chris Angelico) Date: Wed, 4 Dec 2019 16:48:42 +1100 Subject: array and struct 64-bit Linux change in behavior Python 3.7 and 2.7 In-Reply-To: References:

<475624D9-15D8-4A82-A8B4-5A174BA87DCD@barrys-emacs.org> <2b3d0c1a-aab4-9084-1011-975f911352d8@Damon-Family.org> Message-ID: On Wed, Dec 4, 2019 at 4:16 PM Chris Clark wrote: > I think the consensus from the various threads is that the docs are either lacking or misleading. > > I mentioned that this impacts bytes and the problem there is more telling as it hard fails (this is how I first discovered this was an issue): > > >>> array.array('L', b'\0\0\0\0') > Traceback (most recent call last): > File "", line 1, in > ValueError: string length not a multiple of item size > > I don't believe the documentation is accurate by using the word "minimum". Minimum would suggest that it would accept a 4-byte value as a minimum and on 64-bit it does *not*, it hard fails. If it were to document that, "the sizes are native integer types for the platform, the table documents some typical but *not* guaranteed sizes", that would be more clear. > I think array.array() is possibly the wrong tool for this job. If you have a collection of bytes from some well-defined source (eg you're parsing a file in a known format), struct is better suited to it, because it's easy to define both the size and byte order. > For my uses case I'm seriously thinking about not using array moving forward and only using struct. I briefly wondered about ctypes (it has nice names, e.g. c_int64 that are unambiguous) but then I remembered it is not available in Jython). > I wouldn't bother with ctypes for this type of job. > Regarding Barry's comment, yep size consistency with array is a pain - what I implemented as workaround is below (and likely to be my solution going forward): > > x = array.array('L', [0]) > if x.itemsize == 4: > FMT_ARRAY_4BYTE = 'L' > FMT_STRUCT_4BYTE = ' else: > x = array.array('I', [0]) > if x.itemsize == 4: > FMT_ARRAY_4BYTE = 'I' > FMT_STRUCT_4BYTE = ' del(x) > > and then use the constants in array/struct calls where (binary) file IO is happening. Yep, looks like struct is the way to go here. (Especially since you don't have a final 'else'.) ChrisA From p4j at j4d.net Wed Dec 4 08:21:46 2019 From: p4j at j4d.net (Pankaj Jangid) Date: Wed, 04 Dec 2019 18:51:46 +0530 Subject: Developers are advised to purge these malicious packages Message-ID: ``` The Python security team removed two trojanized Python libraries from PyPI (Python Package Index) that were caught stealing SSH and GPG keys from the projects of infected developers. The first is "python3-dateutil," which imitated the popular "dateutil" library. The second is "jeIlyfish" (the first L is an I), which mimicked the "jellyfish" library. ``` https://www.zdnet.com/article/two-malicious-python-libraries-removed-from-pypi/ Regards, -- Pankaj Jangid From ast at invalid Wed Dec 4 10:18:04 2019 From: ast at invalid (ast) Date: Wed, 4 Dec 2019 16:18:04 +0100 Subject: threading Message-ID: <5de7ce2e$0$3871$426a74cc@news.free.fr> Hi An operation like x+=1 on a global variable x is not thread safe because there can be a thread switch between reading and writing to x. The correct way is to use a lock lock = threading.Lock with lock: x+=1 I tried to write a program without the lock which should fail. Here it is: import threading x = 0 def test(): global x for i in range(100): x+=1 threadings = [] for i in range(100): t = threading.Thread(target=test) threadings.append(t) t.start() for t in threadings: t.join() print(x) 10000 The result is always correct: 10000 Why ? Secondly, how the switch between threads is done by the processor ? Is there a hardware interrupt coming from a timer ? From David.Raymond at tomtom.com Wed Dec 4 10:47:57 2019 From: David.Raymond at tomtom.com (David Raymond) Date: Wed, 4 Dec 2019 15:47:57 +0000 Subject: threading In-Reply-To: <5de7ce2e$0$3871$426a74cc@news.free.fr> References: <5de7ce2e$0$3871$426a74cc@news.free.fr> Message-ID: 100 increments happen very fast, and means each thread will probably complete before the main thread has even started the next one. Bump that up to 1_000_000 or so and you'll probably trigger it. I did a test with a print(x) at the start of test() to see what the number was when each thread kicked off, and the very first thread had got it up to 655,562 by the time the second thread had started and gotten to that print statement. -----Original Message----- From: Python-list On Behalf Of ast Sent: Wednesday, December 4, 2019 10:18 AM To: python-list at python.org Subject: threading Hi An operation like x+=1 on a global variable x is not thread safe because there can be a thread switch between reading and writing to x. The correct way is to use a lock lock = threading.Lock with lock: x+=1 I tried to write a program without the lock which should fail. Here it is: import threading x = 0 def test(): global x for i in range(100): x+=1 threadings = [] for i in range(100): t = threading.Thread(target=test) threadings.append(t) t.start() for t in threadings: t.join() print(x) 10000 The result is always correct: 10000 Why ? Secondly, how the switch between threads is done by the processor ? Is there a hardware interrupt coming from a timer ? -- https://mail.python.org/mailman/listinfo/python-list From david at lowryduda.com Wed Dec 4 12:59:57 2019 From: david at lowryduda.com (David Lowry-Duda) Date: Wed, 4 Dec 2019 12:59:57 -0500 Subject: Developers are advised to purge these malicious packages In-Reply-To: References: Message-ID: <20191204175957.GA24123@mail.lowryduda.com> I notice that "python3-dateutil" is in over 4000 github repositories [1]. That sounds like a disaster. [1]: https://github.com/search?q=python3-dateutil&type=Code - DLD -- David Lowry-Duda From christian at python.org Wed Dec 4 13:17:58 2019 From: christian at python.org (Christian Heimes) Date: Wed, 4 Dec 2019 19:17:58 +0100 Subject: Developers are advised to purge these malicious packages In-Reply-To: <20191204175957.GA24123@mail.lowryduda.com> References: <20191204175957.GA24123@mail.lowryduda.com> Message-ID: On 04/12/2019 18.59, David Lowry-Duda wrote: > I notice that "python3-dateutil" is in over 4000 github repositories > [1]. That sounds like a disaster. > > [1]: https://github.com/search?q=python3-dateutil&type=Code At least the first pages are packaging files for Debian, Fedora, and other Linux distributions. Downstream distributions provide a Python package under multiple names. For example the Fedora's build spec [1] creates python2-dateutil and python3-dateutil packages from the python-dateutil upstream project. Attackers abuse the fact and try to typo-squat packages in hope that somebody uses the Linux distribution package name "python3-dateutil" instead of the upstream name "python-dateutil" in requirements.txt Christian [1] https://src.fedoraproject.org/rpms/python-dateutil/blob/master/f/python-dateutil.spec From rob at despammer.com Wed Dec 4 15:25:33 2019 From: rob at despammer.com (RobH) Date: Wed, 4 Dec 2019 20:25:33 +0000 Subject: ImportError: No module named Adafruit_SSD1306 Message-ID: I am trying to do this project on a pi zero: http://frederickvandenbosch.be/?p=1365 I copied the code to the pi zero Download folder and when I run it I get the above error at line 4 Import Adafruit_SSD1306 I am using python version 2.7.16, if that makes any difference I have the same module as the authors' link goes to : Monochrome 1.3" 128x64 OLED graphic display - STEMMA QT / Qwiic Have I missed something. From torriem at gmail.com Wed Dec 4 17:34:50 2019 From: torriem at gmail.com (Michael Torrie) Date: Wed, 4 Dec 2019 15:34:50 -0700 Subject: Developers are advised to purge these malicious packages In-Reply-To: <20191204175957.GA24123@mail.lowryduda.com> References: <20191204175957.GA24123@mail.lowryduda.com> Message-ID: <32dc24a3-1c25-4497-7bdb-428c7a516f2b@gmail.com> On 12/4/19 10:59 AM, David Lowry-Duda wrote: > I notice that "python3-dateutil" is in over 4000 github repositories > [1]. That sounds like a disaster. > > [1]: https://github.com/search?q=python3-dateutil&type=Code It's clearly not, as Christian has already said. In fact it would be very difficult to determine from a github search whether this bad package was actually deployed anywhere. Since it presents a fake "dateutil" module, imports would look the same and proper as using the correct one. The only way this package comes into play is if someone pip installed it, or had an install script that installed it, or if it were bundled in the source tree. So this is very bad indeed, but not as bad as you suggest. We're not nearly as much at risk as node.js npm users are yet. From best_lay at yahoo.com Wed Dec 4 17:33:45 2019 From: best_lay at yahoo.com (Wildman) Date: Wed, 04 Dec 2019 16:33:45 -0600 Subject: ImportError: No module named Adafruit_SSD1306 References: Message-ID: On Wed, 04 Dec 2019 20:25:33 +0000, RobH wrote: > I am trying to do this project on a pi zero: > > http://frederickvandenbosch.be/?p=1365 > > I copied the code to the pi zero Download folder and when I run it I get > the above error at line 4 > Import Adafruit_SSD1306 > > I am using python version 2.7.16, if that makes any difference > I have the same module as the authors' link goes to : > > Monochrome 1.3" 128x64 OLED graphic display - STEMMA QT / Qwiic > > Have I missed something. The error indicates that Adafruit_SSD1306 in not installed. https://github.com/adafruit/Adafruit_Python_SSD1306 -- GNU/Linux user #557453 The cow died so I don't need your bull! From rob at despammer.com Wed Dec 4 18:06:44 2019 From: rob at despammer.com (RobH) Date: Wed, 4 Dec 2019 23:06:44 +0000 Subject: ImportError: No module named Adafruit_SSD1306 In-Reply-To: References:

Message-ID: On 04/12/2019 22:33, Wildman wrote: > On Wed, 04 Dec 2019 20:25:33 +0000, RobH wrote: > >> I am trying to do this project on a pi zero: >> >> http://frederickvandenbosch.be/?p=1365 >> >> I copied the code to the pi zero Download folder and when I run it I get >> the above error at line 4 >> Import Adafruit_SSD1306 >> >> I am using python version 2.7.16, if that makes any difference >> I have the same module as the authors' link goes to : >> >> Monochrome 1.3" 128x64 OLED graphic display - STEMMA QT / Qwiic >> >> Have I missed something. > > The error indicates that Adafruit_SSD1306 in not installed. > > https://github.com/adafruit/Adafruit_Python_SSD1306 > I have the library in the same Downloads folder, but I don't know how to actually install it as it doesn't have an .sh file included From python at python.invalid Wed Dec 4 18:15:28 2019 From: python at python.invalid (Python) Date: Thu, 5 Dec 2019 00:15:28 +0100 Subject: ImportError: No module named Adafruit_SSD1306 In-Reply-To: References:

Message-ID: <5de83dea$0$31397$426a74cc@news.free.fr> Le 05/12/2019 ? 00:06, RobH a ?crit?: > On 04/12/2019 22:33, Wildman wrote: >> On Wed, 04 Dec 2019 20:25:33 +0000, RobH wrote: >> >>> I am trying to do this project on a pi zero: >>> >>> http://frederickvandenbosch.be/?p=1365 >>> >>> I copied the code to the pi zero Download folder and when I run it I get >>> the above error at line 4 >>> Import Adafruit_SSD1306 >>> >>> I am using python version 2.7.16, if that makes any difference >>> I have the same module as the authors' link goes to : >>> >>> Monochrome 1.3" 128x64 OLED graphic display - STEMMA QT / Qwiic >>> >>> Have I missed something. >> >> The error indicates that Adafruit_SSD1306 in not installed. >> >> https://github.com/adafruit/Adafruit_Python_SSD1306 >> > > I have the library in the same Downloads folder, but I don't know how to > actually install it as it doesn't have an .sh file included What cannot you understand in the Installing section of README.md? sudo python -m pip install --upgrade pip setuptools wheel sudo pip install Adafruit-SSD1306 Or alternatively: sudo python -m pip install --upgrade pip setuptools wheel git clone https://github.com/adafruit/Adafruit_Python_SSD1306.git cd Adafruit_Python_SSD1306 sudo python setup.py install even WORSE, what cannot you undertand at the top of same file? This library has been deprecated! We are leaving this up for historical and research purposes but archiving the repository. From pahome.chen at mirlab.org Wed Dec 4 22:28:51 2019 From: pahome.chen at mirlab.org (lampahome) Date: Thu, 5 Dec 2019 11:28:51 +0800 Subject: What does the blue color section in background mean? Message-ID: I tried to plot graph about a time-series with library statsmodel. I decide to plot autocorrelation function, but I don't know the blue section in the example graph mean... Can anyone tell me? The plot_acf example link: https://www.statsmodels.org/dev/generated/statsmodels.graphics.tsaplots.plot_acf.html#statsmodels.graphics.tsaplots.plot_acf The graph in example: https://www.statsmodels.org/dev/plots/graphics_tsa_plot_acf.png thx From p4j at j4d.net Thu Dec 5 04:32:47 2019 From: p4j at j4d.net (Pankaj Jangid) Date: Thu, 05 Dec 2019 15:02:47 +0530 Subject: Developers are advised to purge these malicious packages References: <20191204175957.GA24123@mail.lowryduda.com>

Message-ID: Christian Heimes writes: > On 04/12/2019 18.59, David Lowry-Duda wrote: >> I notice that "python3-dateutil" is in over 4000 github repositories >> [1]. That sounds like a disaster. >> >> [1]: https://github.com/search?q=python3-dateutil&type=Code > > At least the first pages are packaging files for Debian, Fedora, and > other Linux distributions. Downstream distributions provide a Python > package under multiple names. For example the Fedora's build spec [1] > creates python2-dateutil and python3-dateutil packages from the > python-dateutil upstream project. > > Attackers abuse the fact and try to typo-squat packages in hope that > somebody uses the Linux distribution package name "python3-dateutil" > instead of the upstream name "python-dateutil" in requirements.txt > Nice explanation. Thanks. From barry at barrys-emacs.org Thu Dec 5 04:27:43 2019 From: barry at barrys-emacs.org (Barry Scott) Date: Thu, 5 Dec 2019 09:27:43 +0000 Subject: array and struct 64-bit Linux change in behavior Python 3.7 and 2.7 In-Reply-To: <2b3d0c1a-aab4-9084-1011-975f911352d8@Damon-Family.org> References:

<475624D9-15D8-4A82-A8B4-5A174BA87DCD@barrys-emacs.org> <2b3d0c1a-aab4-9084-1011-975f911352d8@Damon-Family.org> Message-ID: <8C71B143-C11A-4A1F-901E-6CEFF6D73814@barrys-emacs.org> > On 3 Dec 2019, at 01:50, Richard Damon wrote: > > On 12/2/19 4:25 PM, Barry Scott wrote: >> >>> On 2 Dec 2019, at 17:55, Rob Gaddi wrote: >>> >>> On 12/2/19 9:26 AM, Chris Clark wrote: >>>> Test case: >>>> import array >>>> array.array('L', [0]) >>>> # x.itemsize == 8 rather than 4 >>>> This works fine (returns 4) under Windows Python 3.7.3 64-bit build. >>>> Under Ubuntu; Python 2.7.15rc1, 3.6.5, 3.70b3 64-bit this returns 8. Documentation at https://docs.python.org/3/library/array.html explicitly states 'L' is for size 4. >>>> It impacts all uses types of array (e.g. reading from byte strings). >>>> The struct module is a little different: >>>> import struct >>>> x = struct.pack('L', 0) >>>> # len(x) ===8 rather than 4 >>>> This can be worked around by using '=L' - which is not well documented - so this maybe a doc issue. >>>> Wanted to post here for comments before opening a bug at https://bugs.python.org/ >>>> Is anyone seeing this under Debian/Ubuntu? >>>> Chris >>> I'd say not a bug, at least in array. Reading that array documentation you linked, 4 is explicitly the MINIMUM size in bytes, not the guaranteed size. >> I'm wondering how useful it is that for array you can read from a file but have no ideas how many bytes each item needs. >> If I have a file with int32_t in it I cannot from the docs know how to read that file into an array. >> >>> The struct situation is, as you said, a bit different. I believe that with the default native alignment @, you're seeing 4-byte data padded to an 8-byte alignment, not 8-byte data. That does seem to go against what the struct documentation says, "Padding is only automatically added between successive structure members. No padding is added at the beginning or the end of the encoded struct." >> The 'L' in struct is documented for 3.7 to use 4 bytes, but in fact uses 8, on fedora 31. Doc bug? >> >>>>> x=struct.pack('L',0x102030405) >>>>> x >> b'\x05\x04\x03\x02\x01\x00\x00\x00' >> >> Given I have exact control with b, h, i, and q but L is not fixed in size I'm not sure how it can be used with certainty across OS and versions. >> >> Barry >> > Actually, you DON'T have exact control with those sizes, it just happens > that all the platforms you are using happen to have the same size for > those types. According to the docs for struct (python 2.7 and python 3.8) I do have exact control for the types I listed. Or did I miss a caveat on that page? The docs for array indeed show that you have no exact control and that is what I'm commenting on. As other have observed that makes array the wrong tool to read data of a fixed format. > Welcome to the ambiguity in the C type system, the basic > types are NOT fixed in size. Of course that is why int32_t etc where added to the C standards. > L means 'Long' and as Christian said, that > is 8 byte long on Linux-64 bit. 'L' is exactly the right type for > interfacing with a routine defined as taking a long. The issue is that > you don't know what type a int32_t will be (it might be int, or it might > be long, and long might not be 32 bits, it will be at least 32 bits). > > Perhaps array could be extended so that it took '4' for a 4 byte integer > and '8' for an 8 byte integer (maybe 'U4' and 'U8' for unsigned). Might > as well also allow 1 and 2 for completeness for char and short (but > those are currently consistent). Personally I have never thought to use array. I have user struct and ctypes extensively and they give me the documented control I need to work with data structures and APIs. Barry > > -- > Richard Damon > > -- > https://mail.python.org/mailman/listinfo/python-list > From rob at despammer.com Thu Dec 5 05:07:28 2019 From: rob at despammer.com (RobH) Date: Thu, 5 Dec 2019 10:07:28 +0000 Subject: ImportError: No module named Adafruit_SSD1306 In-Reply-To: <5de83dea$0$31397$426a74cc@news.free.fr> References:

<5de83dea$0$31397$426a74cc@news.free.fr> Message-ID: On 04/12/2019 23:15, Python wrote: > Le 05/12/2019 ? 00:06, RobH a ?crit?: >> On 04/12/2019 22:33, Wildman wrote: >>> On Wed, 04 Dec 2019 20:25:33 +0000, RobH wrote: >>> >>>> I am trying to do this project on a pi zero: >>>> >>>> http://frederickvandenbosch.be/?p=1365 >>>> >>>> I copied the code to the pi zero Download folder and when I run it I >>>> get >>>> the above error at line 4 >>>> Import Adafruit_SSD1306 >>>> >>>> I am using python version 2.7.16, if that makes any difference >>>> I have the same module as the authors' link goes to : >>>> >>>> Monochrome 1.3" 128x64 OLED graphic display - STEMMA QT / Qwiic >>>> >>>> Have I missed something. >>> >>> The error indicates that Adafruit_SSD1306 in not installed. >>> >>> https://github.com/adafruit/Adafruit_Python_SSD1306 >>> >> >> I have the library in the same Downloads folder, but I don't know how >> to actually install it as it doesn't have an .sh file included > > What cannot you understand in the Installing section of README.md? > > ? sudo python -m pip install --upgrade pip setuptools wheel > ? sudo pip install Adafruit-SSD1306 > > ? Or alternatively: > > ? sudo python -m pip install --upgrade pip setuptools wheel > ? git clone https://github.com/adafruit/Adafruit_Python_SSD1306.git > ? cd Adafruit_Python_SSD1306 > ? sudo python setup.py install > > even WORSE, what cannot you undertand at the top of same file? > > ? This library has been deprecated! We are leaving this up for > ? historical and research purposes but archiving the repository. > > I was looking at the wrong file previously, and got mixed up, doh! I have installed the Adafruit_Python_SSD1306 library now. (There is no mention that I can see about installing other libraries etc to get the project to work, by the author) I had to make some changes in the authors file here: import time import Adafruit_SSD1306 import RPi.GPIO as GPIO <<< disp.begin() File "build/bdist.linux-armv6l/egg/Adafruit_SSD1306/SSD1306.py", line 148, in begin File "build/bdist.linux-armv6l/egg/Adafruit_SSD1306/SSD1306.py", line 247, in _initialize File "build/bdist.linux-armv6l/egg/Adafruit_SSD1306/SSD1306.py", line 129, in command File "build/bdist.linux-armv6l/egg/Adafruit_GPIO/I2C.py", line 116, in write8 File "build/bdist.linux-armv6l/egg/Adafruit_PureIO/smbus.py", line 268, in write_byte_data IOError: [Errno 121] Remote I/O error. Maybe I have created these errors unknowingly by the said changes I made. Thanks From python at mrabarnett.plus.com Thu Dec 5 09:17:07 2019 From: python at mrabarnett.plus.com (MRAB) Date: Thu, 5 Dec 2019 14:17:07 +0000 Subject: What does the blue color section in background mean? In-Reply-To: References: Message-ID: <5982de96-8609-eee2-9b94-cbbd8dabb874@mrabarnett.plus.com> On 2019-12-05 03:28, lampahome wrote: > I tried to plot graph about a time-series with library statsmodel. > > I decide to plot autocorrelation function, but I don't know the blue > section in the example graph mean... > > Can anyone tell me? > > The plot_acf example link: > https://www.statsmodels.org/dev/generated/statsmodels.graphics.tsaplots.plot_acf.html#statsmodels.graphics.tsaplots.plot_acf > > > The graph in example: > https://www.statsmodels.org/dev/plots/graphics_tsa_plot_acf.png > I think that's the confidence interval. Here's another example that shows the same kind of thing: https://machinelearningmastery.com/gentle-introduction-autocorrelation-partial-autocorrelation/ From aishan0403 at gmail.com Thu Dec 5 10:31:54 2019 From: aishan0403 at gmail.com (A S) Date: Thu, 5 Dec 2019 07:31:54 -0800 (PST) Subject: Extract sentences in nested parentheses using Python In-Reply-To: References: <7a365fa0-e721-4a95-8c30-d3661301a7b2@googlegroups.com>

Message-ID: On Tuesday, 3 December 2019 23:48:21 UTC+8, Peter Otten wrote: > A S wrote: > > > On Tuesday, 3 December 2019 01:01:25 UTC+8, Peter Otten wrote: > >> A S wrote: > >> > >> I think I've seen this question before ;) > >> > >> > I am trying to extract all strings in nested parentheses (along with > >> > the parentheses itself) in my .txt file. Please see the sample .txt > >> > file that I have used in this example here: > >> > (https://drive.google.com/open?id=1UKc0ZgY9Fsz5O1rSeBCLqt5dwZkMaQgr). > >> > > >> > I have tried and done up three different codes but none of them seems > >> > to be able to extract all the nested parentheses. They can only extract > >> > a portion of the nested parentheses. Any advice on what I've done wrong > >> > could really help! > >> > > >> > Here are the three codes I have done so far: > >> > > >> > 1st attempt: > >> > > >> > import re > >> > from os.path import join > >> > > >> > def balanced_braces(args): > >> > parts = [] > >> > for arg in args: > >> > if '(' not in arg: > >> > continue > >> > >> There could still be a ")" that you miss > >> > >> > chars = [] > >> > n = 0 > >> > for c in arg: > >> > if c == '(': > >> > if n > 0: > >> > chars.append(c) > >> > n += 1 > >> > elif c == ')': > >> > n -= 1 > >> > if n > 0: > >> > chars.append(c) > >> > elif n == 0: > >> > parts.append(''.join(chars).lstrip().rstrip()) > >> > chars = [] > >> > elif n > 0: > >> > chars.append(c) > >> > return parts > >> > >> It's probably easier to understand and implement when you process the > >> complete text at once. Then arbitrary splits don't get in the way of your > >> quest for ( and ). You just have to remember the position of the first > >> opening ( and number of opening parens that have to be closed before you > >> take the complete expression: > >> > >> level: 00011112222100 > >> text: abc(def(gh))ij > >> when we are here^ > >> we need^ > >> > >> A tentative implementation: > >> > >> $ cat parse.py > >> import re > >> > >> NOT_SET = object() > >> > >> def scan(text): > >> level = 0 > >> start = NOT_SET > >> for m in re.compile("[()]").finditer(text): > >> if m.group() == ")": > >> level -= 1 > >> if level < 0: > >> raise ValueError("underflow: more closing than opening > >> parens") > >> if level == 0: > >> # outermost closing parenthesis: > >> # deliver enclosed string including parens. > >> yield text[start:m.end()] > >> start = NOT_SET > >> elif m.group() == "(": > >> if level == 0: > >> # outermost opening parenthesis: remember position. > >> assert start is NOT_SET > >> start = m.start() > >> level += 1 > >> else: > >> assert False > >> if level > 0: > >> raise ValueError("unclosed parens remain") > >> > >> > >> if __name__ == "__main__": > >> with open("lan sample text file.txt") as instream: > >> text = instream.read() > >> for chunk in scan(text): > >> print(chunk) > >> $ python3 parse.py > >> ("xE'", PUT(xx.xxxx.),"'") > >> ("TRUuuuth") > > > > Hello Peter! I tried this on my actual working files and it returned this > > error: "unclosed parens remain". In this case, how can I continue to parse > > through my text files by only extracting those with balanced parentheses > > and ignore those that are incomplete? > > filenames = ... > for filename in filenames: > with open(filename) as instream: > text = instream.read() > try: > chunks = list(scan(text)) > except ValueError as err: > print(f"{err} in file {filename!r}", file=sys.stderr) > else: > for chunk in chunks: > print(chunk) hey Peter, it works! Thank you :) From aishan0403 at gmail.com Thu Dec 5 10:33:41 2019 From: aishan0403 at gmail.com (A S) Date: Thu, 5 Dec 2019 07:33:41 -0800 (PST) Subject: Extract sentences in nested parentheses using Python In-Reply-To: References: <7a365fa0-e721-4a95-8c30-d3661301a7b2@googlegroups.com> <1d689634-c3db-dd3f-c432-111604d5a784@DancesWithMice.info> Message-ID: <9ea51f20-4ed7-4eaa-9042-8aa9d4d2eb6a@googlegroups.com> On Tuesday, 3 December 2019 06:22:52 UTC+8, DL Neil wrote: > On 3/12/19 6:00 AM, Peter Otten wrote: > > A S wrote: > > I think I've seen this question before ;) > > In addition to 'other reasons' for @Peter's comment, it is a common > ComSc worked-problem or assignment. (in which case, we'd appreciate > being told that you/OP is asking for help with "homework") > > > >> I am trying to extract all strings in nested parentheses (along with the > >> parentheses itself) in my .txt file. Please see the sample .txt file that > >> I have used in this example here: > >> (https://drive.google.com/open?id=1UKc0ZgY9Fsz5O1rSeBCLqt5dwZkMaQgr). > >> > >> I have tried and done up three different codes but none of them seems to > >> be able to extract all the nested parentheses. They can only extract a > >> portion of the nested parentheses. Any advice on what I've done wrong > >> could really help! > > One approach is to research in the hope that there are already existing > tools or techniques which may help/save you from 'reinventing the wheel' > - when you think about it, a re-statement of open-source objectives. > > How does the Python interpreter break-down Python (text) code into its > constituent parts ("tokens") *including* parentheses? Such are made > available in (perhaps) a lesser-known corner of the PSL (Python Standard > Library). Might you be able to use one such tool? > > The ComSc technique which sprang to mind involves "stacks" (a LIFO data > structure) and "RPN" (Reverse Polish Notation). Whereas we like people > to take their turn when it comes to limited resources, eg to form a > "queue" to purchase/pay for goods at the local store, which is "FIFO" > (first-in, first-out); a "stack"/LIFO (last-in, first-out) can be > problematic to put into practical application. There are plenty of > Python implementations or you can 'roll your own' with a list. Again, > I'd likely employ a "deque" from the PSL's Collections library (although > as a "stack" rather than as a "double-ended queue"), because the > optimisation comes "free". (to my laziness, but after some kind soul > sweated-bullets to make it fast (in both senses) for 'the rest of us'!) > > > > It's probably easier to understand and implement when you process the > > complete text at once. Then arbitrary splits don't get in the way of your > > quest for ( and ). You just have to remember the position of the first > > opening ( and number of opening parens that have to be closed before you > > take the complete expression: > > +1 > but as a 'silver surfer', I don't like to be asked to "remember" anything! > > > > level: 00011112222100 > > we need^ > > > Consider: > original_text (the contents of the .txt file - add buffering if volumes > are huge) > current_text (the characters we have processed/"recognised" so-far) > stack (what an original name for such a data-structure! Which will > contain each of the partial parenthetical expressions found - but yet to > be proven/made complete) > > set current_text to NULSTRING > for each current_character in original_text: > if current_character is LEFT_PARENTHESIS: > push current_text to stack > set current_text to LEFT_PARENTHESIS > concatenate current_character with current_text > if current_character is RIGHT_PARENTHESIS: > # current_text is a parenthetical expression > # do with it what you will > pop the stack > set current_text to the ex-stack string \ > concat current_text's p-expn > > Once working: cover 'special cases' (after above loop), eg original_text > which doesn't begin and/or end with parentheses; and error cases, eg > pop-ping a NULSTRING, or thinking things are finished but the stack is > not yet empty - likely events from unbalanced parentheses! > > original text = "abc(def(gh))ij" > > event 1: in-turn, concatenate characters "abc" as current_text > event 2: locate (first) left-parenthesis, push current_text to stack(&) > event 3: concatenate "(def" > event 4: push, likewise > event 5: concatenate "(gh" > event 6: locate (first) right-parenthesis (matches to left-parenthesis > begining the current_string!) > result?: ?print current_text? > event 7: pop stack and redefine current_text as "(def(gh)" > event 8: repeat, per event 6 > event 9: current_text will now become "(def(gh))" > event 10: (special case) run-out of input at "(def(gh)ij" > event 11: (special case) pop (!any) stack and report "abc(def(gh)" > > > NB not sure of need for a "level" number; but if required, you can infer > that at any time, from the depth/len() of the stack! > > NBB being a simple-boy, my preference is for a 'single layer' of code, > cf @Peter's generator. Regardless the processes are "tokenisation" and > "recognition". > > At the back of my mind, was the notion that you may (eventually) be > required to work with more than parentheses, eg pair-wise > square-brackets and/or quotation-marks. In which case, you will need to > also 'push' the token and check for token-pairs when 'pop-ping', as well > as (perhaps) recognising lists of tokens to tokenise instead of the two > parenthesis characters alone. In which case, I'd take a serious look at > the Python Language Services rather than taking a 'roll your own' approach! > > Contrarily, if this spec is 'it', then you might consider optimising the > search processes which 'trigger' the two stack operations, by re-working > the for-loop and utilising string.find() - prioritising whichever > parenthesis is left-most/comes-first - assuming LtR text. (apologies if > you have already tried this in one of your previous approaches) > Unfortunately, such likely results in 'layers' of code, and a generator > might well become the tool-of-choice (I say this before @Peter comes > back and (quite deservedly) flays me alive!). > > > WebRefs: > Python Language Services: https://docs.python.org/3/library/language.html > collections ? Container datatypes: > https://docs.python.org/3/library/collections.html > > See also your ComSc text/reference materials. > -- > Regards =dn Hey DL Neil, this is rather sophisticated for me as I am still learning the basics of Python...But I truly appreciate your help and effort! I did try to read through what you said, but some parts I could not register! From rob at despammer.com Thu Dec 5 13:49:40 2019 From: rob at despammer.com (RobH) Date: Thu, 5 Dec 2019 18:49:40 +0000 Subject: ImportError: No module named Adafruit_SSD1306 Update In-Reply-To: References:

<5de83dea$0$31397$426a74cc@news.free.fr> Message-ID: On 05/12/2019 10:07, RobH wrote: > On 04/12/2019 23:15, Python wrote: >> Le 05/12/2019 ? 00:06, RobH a ?crit?: >>> On 04/12/2019 22:33, Wildman wrote: >>>> On Wed, 04 Dec 2019 20:25:33 +0000, RobH wrote: >>>> >>>>> I am trying to do this project on a pi zero: >>>>> >>>>> http://frederickvandenbosch.be/?p=1365 >>>>> >>>>> I copied the code to the pi zero Download folder and when I run it >>>>> I get >>>>> the above error at line 4 >>>>> Import Adafruit_SSD1306 >>>>> >>>>> I am using python version 2.7.16, if that makes any difference >>>>> I have the same module as the authors' link goes to : >>>>> >>>>> Monochrome 1.3" 128x64 OLED graphic display - STEMMA QT / Qwiic >>>>> >>>>> Have I missed something. >>>> >>>> The error indicates that Adafruit_SSD1306 in not installed. >>>> >>>> https://github.com/adafruit/Adafruit_Python_SSD1306 >>>> >>> >>> I have the library in the same Downloads folder, but I don't know how >>> to actually install it as it doesn't have an .sh file included >> >> What cannot you understand in the Installing section of README.md? >> >> ?? sudo python -m pip install --upgrade pip setuptools wheel >> ?? sudo pip install Adafruit-SSD1306 >> >> ?? Or alternatively: >> >> ?? sudo python -m pip install --upgrade pip setuptools wheel >> ?? git clone https://github.com/adafruit/Adafruit_Python_SSD1306.git >> ?? cd Adafruit_Python_SSD1306 >> ?? sudo python setup.py install >> >> even WORSE, what cannot you undertand at the top of same file? >> >> ?? This library has been deprecated! We are leaving this up for >> ?? historical and research purposes but archiving the repository. >> >> > > I was looking at the wrong file previously, and got mixed up, doh! > I have installed the Adafruit_Python_SSD1306 library now. > > (There is no mention that I can see about installing other libraries etc > to get the project to work, by the author) > Update: I did python3 Internet.py and now only get this error: pi at raspberrypi:~/Downloads $ python3 Internet.py File "Internet.py", line 24 font = ImageFont.truetype( 'Minecraftia.ttf', 35) ^ TabError: inconsistent use of tabs and spaces in indentation I cannot see what is wrong, as the text is all lined up with that above and below: def display_time(): # Collect current time and date if(time_format): current_time = time.strftime("%I:%M") else: current_time = time.strftime("%H:%M") current_date = time.strftime("%d/%m/%Y") # Clear image buffer by drawing a black filled box draw.rectangle((0,0,width,height), outline=0, fill=0) # Set font type and size font = ImageFont.truetype ('Minecraftia.ttf', 35) << error here # Position time x_pos = (disp.width/2)-(string_width(font,current_time)/2) y_pos = 2 + (disp.height-4-8)/2 - (35/2) Thanks From python at mrabarnett.plus.com Thu Dec 5 14:28:43 2019 From: python at mrabarnett.plus.com (MRAB) Date: Thu, 5 Dec 2019 19:28:43 +0000 Subject: ImportError: No module named Adafruit_SSD1306 Update In-Reply-To: References:

<5de83dea$0$31397$426a74cc@news.free.fr> Message-ID: On 2019-12-05 18:49, RobH wrote: > On 05/12/2019 10:07, RobH wrote: >> On 04/12/2019 23:15, Python wrote: >>> Le 05/12/2019 ? 00:06, RobH a ?crit?: >>>> On 04/12/2019 22:33, Wildman wrote: >>>>> On Wed, 04 Dec 2019 20:25:33 +0000, RobH wrote: >>>>> >>>>>> I am trying to do this project on a pi zero: >>>>>> >>>>>> http://frederickvandenbosch.be/?p=1365 >>>>>> >>>>>> I copied the code to the pi zero Download folder and when I run it >>>>>> I get >>>>>> the above error at line 4 >>>>>> Import Adafruit_SSD1306 >>>>>> >>>>>> I am using python version 2.7.16, if that makes any difference >>>>>> I have the same module as the authors' link goes to : >>>>>> >>>>>> Monochrome 1.3" 128x64 OLED graphic display - STEMMA QT / Qwiic >>>>>> >>>>>> Have I missed something. >>>>> >>>>> The error indicates that Adafruit_SSD1306 in not installed. >>>>> >>>>> https://github.com/adafruit/Adafruit_Python_SSD1306 >>>>> >>>> >>>> I have the library in the same Downloads folder, but I don't know how >>>> to actually install it as it doesn't have an .sh file included >>> >>> What cannot you understand in the Installing section of README.md? >>> >>> ?? sudo python -m pip install --upgrade pip setuptools wheel >>> ?? sudo pip install Adafruit-SSD1306 >>> >>> ?? Or alternatively: >>> >>> ?? sudo python -m pip install --upgrade pip setuptools wheel >>> ?? git clone https://github.com/adafruit/Adafruit_Python_SSD1306.git >>> ?? cd Adafruit_Python_SSD1306 >>> ?? sudo python setup.py install >>> >>> even WORSE, what cannot you undertand at the top of same file? >>> >>> ?? This library has been deprecated! We are leaving this up for >>> ?? historical and research purposes but archiving the repository. >>> >>> >> >> I was looking at the wrong file previously, and got mixed up, doh! >> I have installed the Adafruit_Python_SSD1306 library now. >> >> (There is no mention that I can see about installing other libraries etc >> to get the project to work, by the author) >> > > Update: > I did python3 Internet.py > and now only get this error: > > pi at raspberrypi:~/Downloads $ python3 Internet.py > File "Internet.py", line 24 > font = ImageFont.truetype( 'Minecraftia.ttf', 35) > ^ > TabError: inconsistent use of tabs and spaces in indentation > > I cannot see what is wrong, as the text is all lined up with that above > and below: > Are you sure that the indentation is the same? Do all of the lines use all spaces or all tabs or indentation? Just because they look lined-up doesn't mean that they're doing it with the sequence of characters. > def display_time(): > # Collect current time and date > if(time_format): > current_time = time.strftime("%I:%M") > else: > current_time = time.strftime("%H:%M") > > current_date = time.strftime("%d/%m/%Y") > > # Clear image buffer by drawing a black filled box > draw.rectangle((0,0,width,height), outline=0, fill=0) > > # Set font type and size > font = ImageFont.truetype ('Minecraftia.ttf', 35) << error here > > # Position time > x_pos = (disp.width/2)-(string_width(font,current_time)/2) > y_pos = 2 + (disp.height-4-8)/2 - (35/2) > > From rhodri at kynesim.co.uk Thu Dec 5 14:30:31 2019 From: rhodri at kynesim.co.uk (Rhodri James) Date: Thu, 5 Dec 2019 19:30:31 +0000 Subject: ImportError: No module named Adafruit_SSD1306 Update In-Reply-To: References:

<5de83dea$0$31397$426a74cc@news.free.fr> Message-ID: <1c044f94-c429-b7d8-150c-7068148a841e@kynesim.co.uk> On 05/12/2019 18:49, RobH wrote: > Update: > I did python3 Internet.py > and now only get this error: > > pi at raspberrypi:~/Downloads $ python3 Internet.py > ? File "Internet.py", line 24 > ??? font = ImageFont.truetype( 'Minecraftia.ttf', 35) > ??????????????????????????????????????????????????? ^ > TabError: inconsistent use of tabs and spaces in indentation > > I cannot see what is wrong, as the text is all lined up with that above > and below: The problem will be that you have a mix of tabs and spaces in your indentation. This causes problems because some people don't think that the One True Tab Width is 8 characters ;-) so to them the indentation looks ragged. Worse, when they mix tabs and spaces, code that looks to be at the same indentation level to them looks different to the interpreter. The decision was taken a while ago that Python should put its foot down about this, and demand that we use either all tabs or all spaces for our indentation. That's what you've fallen foul off; there must be a mix of tabs and spaces in that line! -- Rhodri James *-* Kynesim Ltd From rhodri at kynesim.co.uk Thu Dec 5 14:40:16 2019 From: rhodri at kynesim.co.uk (Rhodri James) Date: Thu, 5 Dec 2019 19:40:16 +0000 Subject: ImportError: No module named Adafruit_SSD1306 Update In-Reply-To: <1c044f94-c429-b7d8-150c-7068148a841e@kynesim.co.uk> References:

<5de83dea$0$31397$426a74cc@news.free.fr> <1c044f94-c429-b7d8-150c-7068148a841e@kynesim.co.uk> Message-ID: <6d5f872f-5f96-8b00-d6a9-ffab3d177de7@kynesim.co.uk> On 05/12/2019 19:30, Rhodri James wrote: > On 05/12/2019 18:49, RobH wrote: >> Update: >> I did python3 Internet.py >> and now only get this error: >> >> pi at raspberrypi:~/Downloads $ python3 Internet.py >> ?? File "Internet.py", line 24 >> ???? font = ImageFont.truetype( 'Minecraftia.ttf', 35) >> ???????????????????????????????????????????????????? ^ >> TabError: inconsistent use of tabs and spaces in indentation >> >> I cannot see what is wrong, as the text is all lined up with that >> above and below: > > The problem will be that you have a mix of tabs and spaces in your > indentation.? This causes problems because some people don't think that > the One True Tab Width is 8 characters ;-) so to them the indentation > looks ragged.? Worse, when they mix tabs and spaces, code that looks to > be at the same indentation level to them looks different to the > interpreter.? The decision was taken a while ago that Python should put > its foot down about this, and demand that we use either all tabs or all > spaces for our indentation.? That's what you've fallen foul off; there > must be a mix of tabs and spaces in that line! Or more likely you've used tabs on that line and spaces elsewhere, or vice versa. I should have remember to say that, sorry. -- Rhodri James *-* Kynesim Ltd From rob at despammer.com Thu Dec 5 15:28:52 2019 From: rob at despammer.com (RobH) Date: Thu, 5 Dec 2019 20:28:52 +0000 Subject: ImportError: No module named Adafruit_SSD1306 Update In-Reply-To: References:

<5de83dea$0$31397$426a74cc@news.free.fr> <1c044f94-c429-b7d8-150c-7068148a841e@kynesim.co.uk> <6d5f872f-5f96-8b00-d6a9-ffab3d177de7@kynesim.co.uk> Message-ID: On 05/12/2019 19:40, Rhodri James wrote: > On 05/12/2019 19:30, Rhodri James wrote: >> On 05/12/2019 18:49, RobH wrote: >>> Update: >>> I did python3 Internet.py >>> and now only get this error: >>> >>> pi at raspberrypi:~/Downloads $ python3 Internet.py >>> ?? File "Internet.py", line 24 >>> ???? font = ImageFont.truetype( 'Minecraftia.ttf', 35) >>> ???????????????????????????????????????????????????? ^ >>> TabError: inconsistent use of tabs and spaces in indentation >>> >>> I cannot see what is wrong, as the text is all lined up with that >>> above and below: >> >> The problem will be that you have a mix of tabs and spaces in your >> indentation.? This causes problems because some people don't think >> that the One True Tab Width is 8 characters ;-) so to them the >> indentation looks ragged.? Worse, when they mix tabs and spaces, code >> that looks to be at the same indentation level to them looks different >> to the interpreter.? The decision was taken a while ago that Python >> should put its foot down about this, and demand that we use either all >> tabs or all spaces for our indentation.? That's what you've fallen >> foul off; there must be a mix of tabs and spaces in that line! > > Or more likely you've used tabs on that line and spaces elsewhere, or > vice versa.? I should have remember to say that, sorry. > Ok thanks for the explanation there, and I have placed the cursor at the beginning of the first indented line. Moving down 1 line at a time , each line is at the same position upto line 157 in the authors code . Then it is closer in to the edge upto line 190, where it goes back out again. What is my best course of action here now. From joel.goldstick at gmail.com Thu Dec 5 15:39:45 2019 From: joel.goldstick at gmail.com (Joel Goldstick) Date: Thu, 5 Dec 2019 15:39:45 -0500 Subject: ImportError: No module named Adafruit_SSD1306 Update In-Reply-To: References:

<5de83dea$0$31397$426a74cc@news.free.fr> <1c044f94-c429-b7d8-150c-7068148a841e@kynesim.co.uk> <6d5f872f-5f96-8b00-d6a9-ffab3d177de7@kynesim.co.uk> Message-ID: On Thu, Dec 5, 2019 at 3:31 PM RobH wrote: > > On 05/12/2019 19:40, Rhodri James wrote: > > On 05/12/2019 19:30, Rhodri James wrote: > >> On 05/12/2019 18:49, RobH wrote: > >>> Update: > >>> I did python3 Internet.py > >>> and now only get this error: > >>> > >>> pi at raspberrypi:~/Downloads $ python3 Internet.py > >>> File "Internet.py", line 24 > >>> font = ImageFont.truetype( 'Minecraftia.ttf', 35) > >>> ^ > >>> TabError: inconsistent use of tabs and spaces in indentation > >>> > >>> I cannot see what is wrong, as the text is all lined up with that > >>> above and below: > >> > >> The problem will be that you have a mix of tabs and spaces in your > >> indentation. This causes problems because some people don't think > >> that the One True Tab Width is 8 characters ;-) so to them the > >> indentation looks ragged. Worse, when they mix tabs and spaces, code > >> that looks to be at the same indentation level to them looks different > >> to the interpreter. The decision was taken a while ago that Python > >> should put its foot down about this, and demand that we use either all > >> tabs or all spaces for our indentation. That's what you've fallen > >> foul off; there must be a mix of tabs and spaces in that line! > > > > Or more likely you've used tabs on that line and spaces elsewhere, or > > vice versa. I should have remember to say that, sorry. > > > > Ok thanks for the explanation there, and I have placed the cursor at the > beginning of the first indented line. Moving down 1 line at a time , > each line is at the same position upto line 157 in the authors code . > Then it is closer in to the edge upto line 190, where it goes back out > again. > > What is my best course of action here now. > > -- > https://mail.python.org/mailman/listinfo/python-list google or duckduckgo or whatever your text editor and tabs to spaces.. there is probably an easy way to convert the file -- Joel Goldstick http://joelgoldstick.com/blog http://cc-baseballstats.info/stats/birthdays From python at mrabarnett.plus.com Thu Dec 5 15:55:42 2019 From: python at mrabarnett.plus.com (MRAB) Date: Thu, 5 Dec 2019 20:55:42 +0000 Subject: ImportError: No module named Adafruit_SSD1306 Update In-Reply-To: References:

<5de83dea$0$31397$426a74cc@news.free.fr> <1c044f94-c429-b7d8-150c-7068148a841e@kynesim.co.uk> <6d5f872f-5f96-8b00-d6a9-ffab3d177de7@kynesim.co.uk>

Message-ID: On 2019-12-05 20:39, Joel Goldstick wrote: > On Thu, Dec 5, 2019 at 3:31 PM RobH wrote: >> >> On 05/12/2019 19:40, Rhodri James wrote: >> > On 05/12/2019 19:30, Rhodri James wrote: >> >> On 05/12/2019 18:49, RobH wrote: >> >>> Update: >> >>> I did python3 Internet.py >> >>> and now only get this error: >> >>> >> >>> pi at raspberrypi:~/Downloads $ python3 Internet.py >> >>> File "Internet.py", line 24 >> >>> font = ImageFont.truetype( 'Minecraftia.ttf', 35) >> >>> ^ >> >>> TabError: inconsistent use of tabs and spaces in indentation >> >>> >> >>> I cannot see what is wrong, as the text is all lined up with that >> >>> above and below: >> >> >> >> The problem will be that you have a mix of tabs and spaces in your >> >> indentation. This causes problems because some people don't think >> >> that the One True Tab Width is 8 characters ;-) so to them the >> >> indentation looks ragged. Worse, when they mix tabs and spaces, code >> >> that looks to be at the same indentation level to them looks different >> >> to the interpreter. The decision was taken a while ago that Python >> >> should put its foot down about this, and demand that we use either all >> >> tabs or all spaces for our indentation. That's what you've fallen >> >> foul off; there must be a mix of tabs and spaces in that line! >> > >> > Or more likely you've used tabs on that line and spaces elsewhere, or >> > vice versa. I should have remember to say that, sorry. >> > >> >> Ok thanks for the explanation there, and I have placed the cursor at the >> beginning of the first indented line. Moving down 1 line at a time , >> each line is at the same position upto line 157 in the authors code . >> Then it is closer in to the edge upto line 190, where it goes back out >> again. >> >> What is my best course of action here now. >> > google or duckduckgo or whatever your text editor and tabs to spaces.. > there is probably an easy way to convert the file > Or expand the tabs using Python: at the Python prompt, read it in, use the expandtabs method of str, write it back out. From tjreedy at udel.edu Thu Dec 5 18:11:30 2019 From: tjreedy at udel.edu (Terry Reedy) Date: Thu, 5 Dec 2019 18:11:30 -0500 Subject: ImportError: No module named Adafruit_SSD1306 Update In-Reply-To: References:

<5de83dea$0$31397$426a74cc@news.free.fr> <1c044f94-c429-b7d8-150c-7068148a841e@kynesim.co.uk> <6d5f872f-5f96-8b00-d6a9-ffab3d177de7@kynesim.co.uk>

Message-ID: On 12/5/2019 3:55 PM, MRAB wrote: >>> Ok thanks for the explanation there, and I have placed the cursor at the >>> beginning of the first indented line. Moving down 1 line at a time , >>> each line is at the same position upto line 157 in the authors code . >>> Then it is closer in to the edge upto line 190, where it goes back out >>> again. >>> >>> What is my best course of action here now. >>> >> google or duckduckgo or whatever your text editor and tabs to spaces.. >> there is probably an easy way to convert the file >> > Or expand the tabs using Python: at the Python prompt, read it in, use > the expandtabs method of str, write it back out. IDLE's Format menu has an option to do this in memory, so you can check before writing back to disk. -- Terry Jan Reedy From skip.montanaro at gmail.com Fri Dec 6 10:55:20 2019 From: skip.montanaro at gmail.com (Skip Montanaro) Date: Fri, 6 Dec 2019 09:55:20 -0600 Subject: IPC10 archive anywhere? Message-ID: I've poked around a bit and have been unable to come up with an archive of the papers delivered at IPC10 (2001, I believe - pre-PyCon days). Might anyone have a link? Thanks, Skip From skip.montanaro at gmail.com Fri Dec 6 11:08:39 2019 From: skip.montanaro at gmail.com (Skip Montanaro) Date: Fri, 6 Dec 2019 10:08:39 -0600 Subject: IPC10 archive anywhere? In-Reply-To: References: Message-ID: > I've poked around a bit and have been unable to come up with an > archive of the papers delivered at IPC10 (2001, I believe - pre-PyCon > days). Might anyone have a link? Found it a few minutes later: https://legacy.python.org/workshops/ Skip From arj.python at gmail.com Fri Dec 6 12:49:20 2019 From: arj.python at gmail.com (Abdur-Rahmaan Janhangeer) Date: Fri, 6 Dec 2019 21:49:20 +0400 Subject: Zipapp can't find sqlite db Message-ID: Greetings, I'm using zipapp to include a gui + db __main__.py dbs/ file.db When packaging, the db is there. When querying through sqlalchemy, it says can't open db file. Help appreciated! Yours, Abdur-Rahmaan Janhangeer pythonmembers.club | github Mauritius From bob at mellowood.ca Fri Dec 6 13:17:48 2019 From: bob at mellowood.ca (Bob van der Poel) Date: Fri, 6 Dec 2019 11:17:48 -0700 Subject: Unicode filenames Message-ID: I have some files which came off the net with, I'm assuming, unicode characters in the names. I have a very short program which takes the filename and puts into an emacs buffer, and then lets me add information to that new file (it's a poor man's DB). Next, I can look up text in the file and open the saved filename. Everything works great until I hit those darn unicode filenames. Just to confuse me even more, the error seems to be coming from a bit of tkinter code: if sresults.has_key(textAtCursor): bookname = os.path.expanduser(sresults[textAtCursor].strip()) which generates UnicodeWarning: Unicode equal comparison failed to convert both arguments to Unicode - interpreting them as being unequal if sresults.has_key(textAtCursor): I really don't understand the business about "both arguments". Not sure how to proceed here. Hoping for a guideline! Thanks. -- **** Listen to my FREE CD at http://www.mellowood.ca/music/cedars **** Bob van der Poel ** Wynndel, British Columbia, CANADA ** EMAIL: bob at mellowood.ca WWW: http://www.mellowood.ca From PythonList at danceswithmice.info Fri Dec 6 14:40:47 2019 From: PythonList at danceswithmice.info (DL Neil) Date: Sat, 7 Dec 2019 08:40:47 +1300 Subject: Unicode filenames In-Reply-To: References: Message-ID: <2bb4a9d3-4941-1ab5-5acc-acbde6b57e34@DancesWithMice.info> On 7/12/19 7:17 AM, Bob van der Poel wrote: > I have some files which came off the net with, I'm assuming, unicode > characters in the names. I have a very short program which takes the > filename and puts into an emacs buffer, and then lets me add information to > that new file (it's a poor man's DB). > > Next, I can look up text in the file and open the saved filename. > Everything works great until I hit those darn unicode filenames. > > Just to confuse me even more, the error seems to be coming from a bit of > tkinter code: > if sresults.has_key(textAtCursor): > bookname = os.path.expanduser(sresults[textAtCursor].strip()) > > which generates > > UnicodeWarning: Unicode equal comparison failed to convert both arguments > to Unicode - interpreting them as being unequal if > sresults.has_key(textAtCursor): > > I really don't understand the business about "both arguments". Not sure how > to proceed here. Hoping for a guideline! (I'm guessing that) the "both arguments" relates to expanduser() because this is the first time that the fileNM has been identified to Python as anything more than a string of characters. [a fileNM will be a string of characters, but a string of characters is not necessarily a (legal) fileNM!] Further suggesting, that if you are using Python3 (cf 2), your analysis may be the wrong-way-around. Python3 treats strings as Unicode. However, there is, and certainly in the past, was, no requirement for OpSys and IOCS to encode in Unicode. The problem (for me) came from MSFT's (for example) many variations of ISO-8859-n and that there are no clues as to which of these was used in naming the file, and thus many possibly 'translations' into Unicode. You can start to address the issue by using Python's bytes (instead of strings), however that cold reality still intrudes. Do you know the provenance of these files, eg they are in French and from an MS-Win machine? If so, you may be able to use decode() and encode(), but... Still looking for trouble? Knowing a fileNM was in Spanish/Portuguese I was able to take the fileNM's individual Unicode characters/surrogates and subtract an applicable constant, so that accented letters fell 'back' into the correct Unicode range. (this is extremely risky, and could quite easily make matters worse/more confusing). I warn you that pursuing this matter involves disappearing down into a very deep 'rabbit hole', but YMMV! WebRefs: https://docs.python.org/3/howto/unicode.html https://www.dictionary.com/e/slang/rabbit-hole/ -- Regards =dn From ijbrewster at alaska.edu Fri Dec 6 15:58:36 2019 From: ijbrewster at alaska.edu (Israel Brewster) Date: Fri, 6 Dec 2019 11:58:36 -0900 Subject: Make warning an exception? Message-ID: <1EED26AD-2A95-42B0-8D31-58D9854D185B@alaska.edu> I was running some code and I saw this pop up in the console: 2019-12-06 11:53:54.087 Python[85524:39651849] WARNING: nextEventMatchingMask should only be called from the Main Thread! This will throw an exception in the future. The only problem is, I have no idea what is generating that warning - I never call nextEventMatchingMask directly, so it must be getting called from one of the libraries I?m calling. Is there some way I can force python to throw an exception now, so my debugger can catch it and let me know where in my code the originating call is? I?ve tried stepping through the obvious options, with no luck so far. --- Israel Brewster Software Engineer Alaska Volcano Observatory Geophysical Institute - UAF 2156 Koyukuk Drive Fairbanks AK 99775-7320 Work: 907-474-5172 cell: 907-328-9145 From rgaddi at highlandtechnology.invalid Fri Dec 6 16:11:27 2019 From: rgaddi at highlandtechnology.invalid (Rob Gaddi) Date: Fri, 6 Dec 2019 13:11:27 -0800 Subject: Make warning an exception? In-Reply-To: References: <1EED26AD-2A95-42B0-8D31-58D9854D185B@alaska.edu> Message-ID: On 12/6/19 12:58 PM, Israel Brewster wrote: > I was running some code and I saw this pop up in the console: > > 2019-12-06 11:53:54.087 Python[85524:39651849] WARNING: nextEventMatchingMask should only be called from the Main Thread! This will throw an exception in the future. > > The only problem is, I have no idea what is generating that warning - I never call nextEventMatchingMask directly, so it must be getting called from one of the libraries I?m calling. Is there some way I can force python to throw an exception now, so my debugger can catch it and let me know where in my code the originating call is? I?ve tried stepping through the obvious options, with no luck so far. > > --- > Israel Brewster > Software Engineer > Alaska Volcano Observatory > Geophysical Institute - UAF > 2156 Koyukuk Drive > Fairbanks AK 99775-7320 > Work: 907-474-5172 > cell: 907-328-9145 > You need to set the warning filter to "error", which you can do either with warnings.simplefilter at the start of your program or by setting the PYTHONWARNINGS environment variable. https://docs.python.org/3/library/warnings.html#the-warnings-filter This the same project you're having PySide/threading problems on? From PythonList at danceswithmice.info Fri Dec 6 16:16:13 2019 From: PythonList at danceswithmice.info (DL Neil) Date: Sat, 7 Dec 2019 10:16:13 +1300 Subject: Make warning an exception? In-Reply-To: <1EED26AD-2A95-42B0-8D31-58D9854D185B@alaska.edu> References: <1EED26AD-2A95-42B0-8D31-58D9854D185B@alaska.edu> Message-ID: <9a617d11-acc1-f810-2c7c-72d7b8b067b6@DancesWithMice.info> On 7/12/19 9:58 AM, Israel Brewster wrote: > I was running some code and I saw this pop up in the console: > > 2019-12-06 11:53:54.087 Python[85524:39651849] WARNING: nextEventMatchingMask should only be called from the Main Thread! This will throw an exception in the future. > > The only problem is, I have no idea what is generating that warning - I never call nextEventMatchingMask directly, so it must be getting called from one of the libraries I?m calling. Is there some way I can force python to throw an exception now, so my debugger can catch it and let me know where in my code the originating call is? I?ve tried stepping through the obvious options, with no luck so far. We are able to "filter" errors, including turning warnings into full-bore errors. Of possible use: https://docs.python.org/3/library/warnings.html -- Regards =dn From tjreedy at udel.edu Fri Dec 6 18:20:26 2019 From: tjreedy at udel.edu (Terry Reedy) Date: Fri, 6 Dec 2019 18:20:26 -0500 Subject: Unicode filenames In-Reply-To: References: Message-ID: On 12/6/2019 1:17 PM, Bob van der Poel wrote: > I have some files which came off the net with, I'm assuming, unicode > characters in the names. I have a very short program which takes the > filename and puts into an emacs buffer, and then lets me add information to > that new file (it's a poor man's DB). > > Next, I can look up text in the file and open the saved filename. > Everything works great until I hit those darn unicode filenames. > > Just to confuse me even more, the error seems to be coming from a bit of > tkinter code: > if sresults.has_key(textAtCursor): > bookname = os.path.expanduser(sresults[textAtCursor].strip()) 'textAtCursor' does not appear in any 3.9 tkinter/*.py file > which generates > > UnicodeWarning: Unicode equal comparison failed to convert both arguments > to Unicode - interpreting them as being unequal if > sresults.has_key(textAtCursor): > > I really don't understand the business about "both arguments". 'sresults.has_key(textAtCursor)' will see if the hash value of textAtCursor matches the hash value of any key and then compare the strings. 'failed to convert' suggests to me that you are running 2.x and that one of the strings is bytes and the other unicode. Not sure how > to proceed here. Hoping for a guideline! > > Thanks. > > -- Terry Jan Reedy From spayth77 at gmail.com Fri Dec 6 18:53:11 2019 From: spayth77 at gmail.com (Sam Paython) Date: Fri, 6 Dec 2019 15:53:11 -0800 (PST) Subject: Error getting data from website Message-ID: <789b0fa9-e2ef-4daa-9e85-d3c8b34223cd@googlegroups.com> Hi all, This is the code I am writing: import requests from bs4 import BeautifulSoup request = requests.get("https://www.amazon.ca/dp/B07RZFQ6HC") content = request.content soup = BeautifulSoup(content, "html.parser") element = soup.find("span",{"id":"priceblock_dealprice"}) print(element.text.strip()) and this is the error I am getting: C:\Users\Sam\PycharmProjects\untitled2\venv\Scripts\python.exe C:/Users/Sam/PycharmProjects/untitled2/src/app.py Traceback (most recent call last): File "C:/Users/Sam/PycharmProjects/untitled2/src/app.py", line 9, in print(element.text.strip()) AttributeError: 'NoneType' object has no attribute 'text' Could someone please help? From PythonList at DancesWithMice.info Fri Dec 6 19:31:05 2019 From: PythonList at DancesWithMice.info (DL Neil) Date: Sat, 7 Dec 2019 13:31:05 +1300 Subject: Error getting data from website In-Reply-To: <789b0fa9-e2ef-4daa-9e85-d3c8b34223cd@googlegroups.com> References: <789b0fa9-e2ef-4daa-9e85-d3c8b34223cd@googlegroups.com> Message-ID: <0dfb757a-0549-4c15-0e67-48511e10f671@DancesWithMice.info> On 7/12/19 12:53 PM, Sam Paython wrote: > This is the code I am writing: > import requests > from bs4 import BeautifulSoup > request = requests.get("https://www.amazon.ca/dp/B07RZFQ6HC") > content = request.content > soup = BeautifulSoup(content, "html.parser") > element = soup.find("span",{"id":"priceblock_dealprice"}) > print(element.text.strip()) > > and this is the error I am getting: > C:\Users\Sam\PycharmProjects\untitled2\venv\Scripts\python.exe C:/Users/Sam/PycharmProjects/untitled2/src/app.py > Traceback (most recent call last): > File "C:/Users/Sam/PycharmProjects/untitled2/src/app.py", line 9, in > print(element.text.strip()) > AttributeError: 'NoneType' object has no attribute 'text' > > Could someone please help? The err.msg/stack-trace is your friend! The comment about "NoneType" means 'there's nothing there' (roughly!) to print(). The question then becomes: "why?" or "why not?"... With a short piece of code like this, and (I am assuming) trying-out a library for the first time, may I recommend that you use the Python REPL, because it allows you to 'see' what's going-on behind the scenes/underneath the hood - and ultimately, reveals the problem. From a Python terminal (cmd is appropriate to your PC's OpSys): [dn at JrBrown ~]$ python3 Python 3.7.4 (default, Jul 9 2019, 16:48:28) [GCC 8.3.1 20190223 (Red Hat 8.3.1-2)] on linux Type "help", "copyright", "credits" or "license" for more information. >>> import requests >>> from bs4 import BeautifulSoup >>> request = requests.get("https://www.amazon.ca/dp/B07RZFQ6HC") >>> request # notice how I'm asking to 'see' what happened >>> content = request.content >>> content # there is no need to enclose in print()! b'\n 1 np.dtype([('Q', 'I', 'Q')]) > > ValueError: mismatch in size of old and new data-descriptor > > In [3]: np.dtype([('field1', 'Q'), ('field2', 'I'), ('field3', > 'Q')])??????????????????????????????????????????????????????????????? > Out[3]: dtype([('field1', ' > In [4]:??? > > ... and now let's put it all together! s1 = struct.Struct("@QIQ") ss1 = s1.pack(1,11,111) struct_dtype = np.dtype([('field1', 'Q'), ('field2', 'I'), ('field3', 'Q')]) a = np.frombuffer(ss1, dtype=struct_dtype) I'm using the frombuffer() function deliberately so I don't have to figure out the shape of the final array (which is (1,), not (3,), by the way). And hey presto: it raises an exception! > ValueError: buffer size must be a multiple of element size Your example shows a difference between the default behaviour of numpy's structured dtype and the struct module: packing! By default, numpy structured dtypes are closely packed, i.e. nothing is aligned to useful memory boundaries. struct_type.itemsize == 20 The struct module, on the other hand, tries to guess where the C compiler would put its padding. len(ss1) == 24 We can tell numpy to do the same: struct_dtype = np.dtype([('field1', 'Q'), ('field2', 'I'), ('field3', 'Q')], align=True) and then a = np.frombuffer(ss1, dtype=struct_dtype) works and produces array([(1, 11, 111)], ????? dtype={'names':['field1','field2','field3'], 'formats':[' References: <303a8c7b-1c1b-12c0-bb0c-4cd70be8611a@kynesim.co.uk> Message-ID: <6ff313c1-d38b-5c5b-99cc-743413416f2f@kynesim.co.uk> On 19/12/2019 11:23, tommy yama wrote: > Thanks for your kind response. > The error was simply "module Hexdump was not found". Several things: a) Did it really say "module Hexdump was not found"? "hexdump" and "Hexdump" are not the same things; module names are case-sensitive. b) There will have been a whole load more stack trace that might be useful to us. I should have been clearer when I asked you to copy and paste the error that I really meant the whole of the complaint that Python made to you, not just the final error message! Apologies for that. c) I would much prefer it if you didn't top-post, but interleaved your replies like I've done here. I find it hard to follow top-posted messages because they reverse the normal flow of conversation. > > > On Wed, Dec 18, 2019 at 11:39 PM Rhodri James wrote: > >> On 18/12/2019 02:23, tommy yama wrote: >>> Hi, >>> >>> This sounds familiar to somebody? >>> After upgrading my mac OS to Catalina, this persists even after pip3 >>> install hexdump. >>> >>> [image: image.png] >> >> I'm afraid this is a text-only mailing list. Your screenshot has been >> stripped out before any of us saw it. Could you copy and paste (DON'T >> retype!) the error instead, so we can all read it? >> >> -- >> Rhodri James *-* Kynesim Ltd >> -- >> https://mail.python.org/mailman/listinfo/python-list >> > -- Rhodri James *-* Kynesim Ltd From bluebox03 at gmail.com Thu Dec 19 07:43:11 2019 From: bluebox03 at gmail.com (tommy yama) Date: Thu, 19 Dec 2019 12:43:11 +0000 Subject: hexdump module installation error In-Reply-To: <6ff313c1-d38b-5c5b-99cc-743413416f2f@kynesim.co.uk> References: <303a8c7b-1c1b-12c0-bb0c-4cd70be8611a@kynesim.co.uk> <6ff313c1-d38b-5c5b-99cc-743413416f2f@kynesim.co.uk> Message-ID: Hi Rhodri, Thanks for your quick response i did not expect. I hope you see the error below in my response as i just copy and paste it. "no module named 'hexdump'." In addition, i tried to execute python3 hexdump.py. However, no such file directory. from hexdump import hexdump ModuleNotFoundError: No module named 'hexdump' user at USERnoMacBook-Air LibraBrowser % python3 hexdump.py /usr/local/Cellar/python/3.7.5/Frameworks/Python.framework/Versions/3.7/Resources/Python.app/Contents/MacOS/Python: can't open file 'hexdump.py': [Errno 2] No such file or directory user at USERnoMacBook-Air LibraBrowser % Cheers, On Thu, Dec 19, 2019 at 12:21 PM Rhodri James wrote: > On 19/12/2019 11:23, tommy yama wrote: > > Thanks for your kind response. > > The error was simply "module Hexdump was not found". > > Several things: > > a) Did it really say "module Hexdump was not found"? "hexdump" and > "Hexdump" are not the same things; module names are case-sensitive. > > b) There will have been a whole load more stack trace that might be > useful to us. I should have been clearer when I asked you to copy and > paste the error that I really meant the whole of the complaint that > Python made to you, not just the final error message! Apologies for that. > > c) I would much prefer it if you didn't top-post, but interleaved your > replies like I've done here. I find it hard to follow top-posted > messages because they reverse the normal flow of conversation. > > > > > > > On Wed, Dec 18, 2019 at 11:39 PM Rhodri James > wrote: > > > >> On 18/12/2019 02:23, tommy yama wrote: > >>> Hi, > >>> > >>> This sounds familiar to somebody? > >>> After upgrading my mac OS to Catalina, this persists even after pip3 > >>> install hexdump. > >>> > >>> [image: image.png] > >> > >> I'm afraid this is a text-only mailing list. Your screenshot has been > >> stripped out before any of us saw it. Could you copy and paste (DON'T > >> retype!) the error instead, so we can all read it? > >> > >> -- > >> Rhodri James *-* Kynesim Ltd > >> -- > >> https://mail.python.org/mailman/listinfo/python-list > >> > > > > > -- > Rhodri James *-* Kynesim Ltd > From rhodri at kynesim.co.uk Thu Dec 19 07:46:01 2019 From: rhodri at kynesim.co.uk (Rhodri James) Date: Thu, 19 Dec 2019 12:46:01 +0000 Subject: hexdump module installation error In-Reply-To: References: <303a8c7b-1c1b-12c0-bb0c-4cd70be8611a@kynesim.co.uk> <6ff313c1-d38b-5c5b-99cc-743413416f2f@kynesim.co.uk> Message-ID: <218a46b0-520b-aa2e-b02f-9621605b9a52@kynesim.co.uk> On 19/12/2019 12:43, tommy yama wrote: > Thanks for your quick response i did not expect. > I hope you see the error below in my response as i just copy and paste it. > > "no module named 'hexdump'." > > In addition, i tried to execute python3 hexdump.py. However, no such file > directory. > > from hexdump import hexdump > > ModuleNotFoundError: No module named 'hexdump' > > user at USERnoMacBook-Air LibraBrowser % python3 hexdump.py > > /usr/local/Cellar/python/3.7.5/Frameworks/Python.framework/Versions/3.7/Resources/Python.app/Contents/MacOS/Python: > can't open file 'hexdump.py': [Errno 2] No such file or directory > > user at USERnoMacBook-Air LibraBrowser % Huh. You've done a "pip3 install hexdump" so I don't know what might be happening here. Sorry. -- Rhodri James *-* Kynesim Ltd From lele at metapensiero.it Thu Dec 19 08:16:48 2019 From: lele at metapensiero.it (Lele Gaifax) Date: Thu, 19 Dec 2019 14:16:48 +0100 Subject: Cython, producing different modules from the same .pyx Message-ID: <87a77osben.fsf@metapensiero.it> Hi all, in my package, I would like to compile and distribute two different extension modules starting from the same .pyx file, just with different compilation flags and libraries. My first approach has been duplicating the Extension() entry in the setup.py(*), changing the first argument (that is, the name of the module). Although that did produce the alternative binary module, it could not be loaded because it contains the wrong PyInit_XXX(), where XXX is computed from the .pyx name, in my case "parser". I tried looking at Cython documentation, and even its sources, but couldn't find a way to alter that name. Did I miss something, or is the only way to duplicate the source .pyx file to a different name? Thanks in advance, ciao, lele. (*) https://github.com/lelit/pglast/blob/master/setup.py#L76 -- nickname: Lele Gaifax | Quando vivr? di quello che ho pensato ieri real: Emanuele Gaifas | comincer? ad aver paura di chi mi copia. lele at metapensiero.it | -- Fortunato Depero, 1929. From rosuav at gmail.com Thu Dec 19 09:08:00 2019 From: rosuav at gmail.com (Chris Angelico) Date: Fri, 20 Dec 2019 01:08:00 +1100 Subject: hexdump module installation error In-Reply-To: References: <303a8c7b-1c1b-12c0-bb0c-4cd70be8611a@kynesim.co.uk> <6ff313c1-d38b-5c5b-99cc-743413416f2f@kynesim.co.uk> Message-ID: On Thu, Dec 19, 2019 at 11:44 PM tommy yama wrote: > > Hi Rhodri, > > Thanks for your quick response i did not expect. > I hope you see the error below in my response as i just copy and paste it. > > "no module named 'hexdump'." > > In addition, i tried to execute python3 hexdump.py. However, no such file > directory. > > from hexdump import hexdump > Did you, at some point, have a file called hexdump.py that you were playing with? Sometimes, even after you delete a file with a conflicting name, its .pyc file hangs around. Blow away the __pycache__ directory to get rid of it. ChrisA From bluebox03 at gmail.com Thu Dec 19 09:09:12 2019 From: bluebox03 at gmail.com (tommy yama) Date: Thu, 19 Dec 2019 14:09:12 +0000 Subject: hexdump module installation error In-Reply-To: <218a46b0-520b-aa2e-b02f-9621605b9a52@kynesim.co.uk> References: <303a8c7b-1c1b-12c0-bb0c-4cd70be8611a@kynesim.co.uk> <6ff313c1-d38b-5c5b-99cc-743413416f2f@kynesim.co.uk> <218a46b0-520b-aa2e-b02f-9621605b9a52@kynesim.co.uk> Message-ID: Yes. thanks for your enthusiasm. i may raise this in the git On Thu, Dec 19, 2019 at 12:46 PM Rhodri James wrote: > On 19/12/2019 12:43, tommy yama wrote: > > Thanks for your quick response i did not expect. > > I hope you see the error below in my response as i just copy and paste > it. > > > > "no module named 'hexdump'." > > > > In addition, i tried to execute python3 hexdump.py. However, no such file > > directory. > > > > from hexdump import hexdump > > > > ModuleNotFoundError: No module named 'hexdump' > > > > user at USERnoMacBook-Air LibraBrowser % python3 hexdump.py > > > > > /usr/local/Cellar/python/3.7.5/Frameworks/Python.framework/Versions/3.7/Resources/Python.app/Contents/MacOS/Python: > > can't open file 'hexdump.py': [Errno 2] No such file or directory > > > > user at USERnoMacBook-Air LibraBrowser % > > Huh. You've done a "pip3 install hexdump" so I don't know what might be > happening here. Sorry. > > -- > Rhodri James *-* Kynesim Ltd > From onlinejudge95 at gmail.com Thu Dec 19 09:11:40 2019 From: onlinejudge95 at gmail.com (onlinejudge95) Date: Thu, 19 Dec 2019 19:41:40 +0530 Subject: Understanding of GIL Message-ID: Hi Devs, I am currently writing some custom Django commands for data updation, my workflow is like Fetch data from *PostgreSQL*. Call *Elasticsearch* for searching based on the data fetched from *PostgreSQL*. Query *PostgreSQL* and do an upsert behavior. I am using pandas data frame to hold my data during processing. The host we are using to run this jobs has a *CPython* interpreter as given by `platform.python_implementation()` I want to confirm whether *multithreading* would be a better choice here, given the fact that GIL is the biggest blocker(I agree it has to be there) for the same in CPython interpreters. In case further information is required do let me know. Thanks onlinejudge95 From pieter-l at vanoostrum.org Thu Dec 19 09:22:39 2019 From: pieter-l at vanoostrum.org (Pieter van Oostrum) Date: Thu, 19 Dec 2019 15:22:39 +0100 Subject: INHERITANCE in python3 References: <1681866660.1226419.1576728600632.ref@mail.yahoo.com> <1681866660.1226419.1576728600632@mail.yahoo.com> <250232d0-d5ce-4007-89a0-32e0fc23590a@www.fastmail.com> Message-ID: Random832 writes: > On Wed, Dec 18, 2019, at 23:10, vahid asadi via Python-list wrote: >> HI guys this is my first post on python mailing lists ever and i glad >> to do this. >> my problem here is why this attribute is not recognize by python and it >> raise an traceback error that said 'there is no such p.family >> attribute'. although i use multiple ??inheritance with 'super ' it not >> works. thanks for your help. >> >> ```?class Username:? ? def __init__(self,name,*args):? ? ? ? self.name >> = name >> class Userfamily:? ? def __init__(self,family,*args):? ? ? ? >> self.family = family >> class Person(Username,Userfamily):? ? def __init__(self,*args):? ? ? ? >> super().__init__(*args) >> >> >> p = Person("v","aaa")print(p.name)print(p.family)``` Please next time, supply a properly indented Python source, with only normal ASCII spaces, not no-break spaces, i.e. exactly like in your Python source code. > > The Username class also needs to call super(). In general, super() is > intended to be used with all classes that might be part of a multiple > inheritance hierarchy, not just the derived one. Just for safety, also add it to the Userfamily class. -- Pieter van Oostrum www: http://pieter.vanoostrum.org/ PGP key: [8DAE142BE17999C4] From pieter-l at vanoostrum.org Thu Dec 19 09:30:52 2019 From: pieter-l at vanoostrum.org (Pieter van Oostrum) Date: Thu, 19 Dec 2019 15:30:52 +0100 Subject: hexdump module installation error References: <303a8c7b-1c1b-12c0-bb0c-4cd70be8611a@kynesim.co.uk> <6ff313c1-d38b-5c5b-99cc-743413416f2f@kynesim.co.uk>

Message-ID: tommy yama writes: > user at USERnoMacBook-Air LibraBrowser % python3 hexdump.py > > /usr/local/Cellar/python/3.7.5/Frameworks/Python.framework/Versions/3.7/Resources/Python.app/Contents/MacOS/Python: > can't open file 'hexdump.py': [Errno 2] No such file or directory > > user at USERnoMacBook-Air LibraBrowser % Could it be that your pip3 belongs to a different Python than the one above (for example a Python 3.8 or 3.6)? What is the output of 'pip3 --version' (without quotes)? -- Pieter van Oostrum www: http://pieter.vanoostrum.org/ PGP key: [8DAE142BE17999C4] From ethan at stoneleaf.us Thu Dec 19 12:35:49 2019 From: ethan at stoneleaf.us (Ethan Furman) Date: Thu, 19 Dec 2019 09:35:49 -0800 Subject: Cython, producing different modules from the same .pyx In-Reply-To: <87a77osben.fsf@metapensiero.it> References: <87a77osben.fsf@metapensiero.it> Message-ID: <897ebc73-44f1-461f-28e1-9119c76538a4@stoneleaf.us> On 12/19/2019 05:16 AM, Lele Gaifax wrote: > in my package, I would like to compile and distribute two different extension > modules starting from the same .pyx file, just with different compilation > flags and libraries. If you don't get an answer here, you can try the Cython Users group: https://groups.google.com/forum/#!forum/cython-users -- ~Ethan~ From ethan at stoneleaf.us Thu Dec 19 12:28:21 2019 From: ethan at stoneleaf.us (Ethan Furman) Date: Thu, 19 Dec 2019 09:28:21 -0800 Subject: INHERITANCE in python3 In-Reply-To: References: <1681866660.1226419.1576728600632.ref@mail.yahoo.com> <1681866660.1226419.1576728600632@mail.yahoo.com> <250232d0-d5ce-4007-89a0-32e0fc23590a@www.fastmail.com> Message-ID: <3e2e66f4-f5e4-0bf2-b9fc-93e71cbc3755@stoneleaf.us> On 12/19/2019 06:22 AM, Pieter van Oostrum wrote: > Random832 writes: >> On Wed, Dec 18, 2019, at 23:10, wrote: [vahid asadi] >>> my problem here is why this attribute is not recognize by python and it >>> raise an traceback error that said 'there is no such p.family >>> attribute'. although i use multiple inheritance with 'super ' it not >>> works. thanks for your help. >>> >>> class Username: >>> def __init__(self, name, *args): >>> self.name= name >>> >>> class Userfamily: >>> def __init__(self, family, *args): >>> self.family = family >>> >>> class Person(Username, Userfamily): >>> def __init__(self, *args): >>> super().__init__(*args) >>> >>> p = Person("v", "aaa") >>> print(p.name) >>> print(p.family) [Random32] >> The Username class also needs to call super(). In general, super() is >> intended to be used with all classes that might be part of a multiple >> inheritance hierarchy, not just the derived one. [Pieter van Oostrum] > Just for safety, also add it* to the Userfamily class. * a call to super() (in case anybody else misreads that like I did) -- ~Ethan~ From * at eli.users.panix.com Thu Dec 19 16:04:57 2019 From: * at eli.users.panix.com (Eli the Bearded) Date: Thu, 19 Dec 2019 21:04:57 +0000 (UTC) Subject: on sorting things References:

Message-ID: In comp.lang.python, Peter Otten <__peter__ at web.de> wrote: > Eli the Bearded wrote: >> But what caught my eye most, as someone relatively new to Python but >> with long experience in C in Perl, is sorting doesn't take a s/C in /C and/ Ugh. >> *comparison* function, it takes a *key generator* function, and that >> function is supposed to transform the thing into something that the >> native comparison knows how to compare. >> >> This seems a strange choice, and I'm wondering if someone can explain >> the benefits of doing it that way to me. > > Python 2 started with a comparison function and then grew a key function. > With a key function you still have to compare items, you are just breaking > the comparison into two steps: [snip] Thanks for that good explanation. The benchmark comparison makes it very thorough. In my mind I gravitate towards the complicated sorts of sort that can be quickly compared for some sorts of keys and not as quickly for others. Consider a sort that first compares file size and if the same number of bytes, then compares file checksum. Any decently scaled real world implementation would memoize the checksum for speed, but only work it out for files that do not have a unique file size. The key method requires it worked out in advance for everything. But I see the key method handles the memoization under the hood for you, so those simpler, more common sorts of sort get an easy to see benefit. Elijah ------ even memoizing the stat() calls would help for large lists From rosuav at gmail.com Thu Dec 19 16:23:44 2019 From: rosuav at gmail.com (Chris Angelico) Date: Fri, 20 Dec 2019 08:23:44 +1100 Subject: on sorting things In-Reply-To: References:

Message-ID: On Fri, Dec 20, 2019 at 8:06 AM Eli the Bearded <*@eli.users.panix.com> wrote: > > In comp.lang.python, Peter Otten <__peter__ at web.de> wrote: > > Eli the Bearded wrote: > >> But what caught my eye most, as someone relatively new to Python but > >> with long experience in C in Perl, is sorting doesn't take a > > s/C in /C and/ > > Ugh. > > >> *comparison* function, it takes a *key generator* function, and that > >> function is supposed to transform the thing into something that the > >> native comparison knows how to compare. > >> > >> This seems a strange choice, and I'm wondering if someone can explain > >> the benefits of doing it that way to me. > > > > Python 2 started with a comparison function and then grew a key function. > > With a key function you still have to compare items, you are just breaking > > the comparison into two steps: > > [snip] > > Thanks for that good explanation. The benchmark comparison makes it > very thorough. > > In my mind I gravitate towards the complicated sorts of sort that can be > quickly compared for some sorts of keys and not as quickly for others. > > Consider a sort that first compares file size and if the same number of > bytes, then compares file checksum. Any decently scaled real world > implementation would memoize the checksum for speed, but only work it out > for files that do not have a unique file size. The key method requires > it worked out in advance for everything. > > But I see the key method handles the memoization under the hood for you, > so those simpler, more common sorts of sort get an easy to see benefit. > I guess that's a strange situation that might actually need this kind of optimization, but if you really do have that situation, you can make a magical key that behaves the way you want. class SizeChecksum: def __init__(self, fn): self.size = os.stat(fn).st_size self._checksum = None @property def checksum(self): if self._checksum is not None: return self._checksum ... def __lt__(self, other): if self.size != other.size: return self.size < other.size return self.checksum < other.checksum I can't remember exactly which comparison operators list.sort() uses, so you'd want to add another function here for an equality check, or maybe <=, or whichever is used. In any case, what matters is that the weird situations can still be handled, albeit by largely falling back on comparison semantics. But for the vast majority of situations, all you need is a key function that returns a number, a string, or a tuple of numbers and strings. ChrisA From pahome.chen at mirlab.org Thu Dec 19 23:33:25 2019 From: pahome.chen at mirlab.org (lampahome) Date: Fri, 20 Dec 2019 12:33:25 +0800 Subject: How to improve epoll speed when recv from kernel via netlink? Message-ID: I tried to receive msg from kernel via netlink of socket. And I use epoll to receive netlink events whenever it comes from kernel to user space. But I found the performance is poor e.g. epoll costs 90% time of execution time after I profile it by cProfile module. Are there any tips to improve this? From greg.ewing at canterbury.ac.nz Fri Dec 20 00:58:55 2019 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Fri, 20 Dec 2019 18:58:55 +1300 Subject: Cython, producing different modules from the same .pyx In-Reply-To: References: <87a77osben.fsf@metapensiero.it> Message-ID: On 20/12/19 2:16 am, Lele Gaifax wrote: > My first approach has been duplicating the Extension() entry in the > setup.py(*), changing the first argument (that is, the name of the module). > Although that did produce the alternative binary module, it could not be > loaded because it contains the wrong PyInit_XXX(), where XXX is computed from > the .pyx name, You could try creating a set of top-level .pyx stubs, each of which just 'include' the real code. -- Greg From p4j at j4d.net Fri Dec 20 07:12:24 2019 From: p4j at j4d.net (Pankaj Jangid) Date: Fri, 20 Dec 2019 17:42:24 +0530 Subject: [HN] How to Make Python Wait Message-ID: https://news.ycombinator.com/item?id=21834408 From rosuav at gmail.com Fri Dec 20 08:58:22 2019 From: rosuav at gmail.com (Chris Angelico) Date: Sat, 21 Dec 2019 00:58:22 +1100 Subject: [HN] How to Make Python Wait In-Reply-To: References: Message-ID: On Fri, Dec 20, 2019 at 11:16 PM Pankaj Jangid wrote: > > https://news.ycombinator.com/item?id=21834408 > > Did you just post a blog article, then spam everywhere to try to get traffic, where your entire blog post is telling people worse ways to do a time.sleep()? ChrisA From arj.python at gmail.com Fri Dec 20 09:22:26 2019 From: arj.python at gmail.com (Abdur-Rahmaan Janhangeer) Date: Fri, 20 Dec 2019 18:22:26 +0400 Subject: [HN] How to Make Python Wait In-Reply-To: References:

Message-ID: Original from miguel grinberg From nt_mahmood at yahoo.com Fri Dec 20 02:22:00 2019 From: nt_mahmood at yahoo.com (Mahmood Naderan) Date: Fri, 20 Dec 2019 07:22:00 +0000 (UTC) Subject: Unable to install "collect" via pip3 References: <414413727.1095151.1576826520979.ref@mail.yahoo.com> Message-ID: <414413727.1095151.1576826520979@mail.yahoo.com> Hi I can install collect with pip for python2.7 $ pip install --user collect Collecting collect Using cached https://files.pythonhosted.org/packages/cf/5e/c0f0f51d081665374a2c219ea4ba23fb1e179b70dded96dc16606786d828/collect-0.1.1.tar.gz Collecting couchdbkit>=0.5.7 (from collect) Using cached https://files.pythonhosted.org/packages/a1/13/9e9ff695a385c44f62b4766341b97f2bd8b596962df2a0beabf358468b70/couchdbkit-0.6.5.tar.gz Collecting restkit>=4.2.2 (from couchdbkit>=0.5.7->collect) Downloading https://files.pythonhosted.org/packages/76/b9/d90120add1be718f853c53008cf5b62d74abad1d32bd1e7097dd913ae053/restkit-4.2.2.tar.gz (1.3MB) 100% |????????????????????????????????| 1.3MB 633kB/s Collecting http-parser>=0.8.3 (from restkit>=4.2.2->couchdbkit>=0.5.7->collect) Downloading https://files.pythonhosted.org/packages/07/c4/22e3c76c2313c26dd5f84f1205b916ff38ea951aab0c4544b6e2f5920d64/http-parser-0.8.3.tar.gz (83kB) 100% |????????????????????????????????| 92kB 2.4MB/s Collecting socketpool>=0.5.3 (from restkit>=4.2.2->couchdbkit>=0.5.7->collect) Downloading https://files.pythonhosted.org/packages/d1/39/fae99a735227234ffec389b252c6de2bc7816bf627f56b4c558dc46c85aa/socketpool-0.5.3.tar.gz Building wheels for collected packages: collect, couchdbkit, restkit, http-parser, socketpool Running setup.py bdist_wheel for collect ... done Stored in directory: /home/mnaderan/.cache/pip/wheels/b9/7c/7c/b09b334cc0e27b4f63ee9f6f19ca1f3db8672666a7e0f3d9cd Running setup.py bdist_wheel for couchdbkit ... done Stored in directory: /home/mnaderan/.cache/pip/wheels/f6/05/1b/f8f576ef18564bc68ab6e64f405e1263448036208cafb221e0 Running setup.py bdist_wheel for restkit ... done Stored in directory: /home/mnaderan/.cache/pip/wheels/48/c5/32/d0d25fb272791a68c49c26150f332d9b9492d0bc9ea0cdd2c7 Running setup.py bdist_wheel for http-parser ... done Stored in directory: /home/mnaderan/.cache/pip/wheels/22/db/06/cb609a3345e7aa87206de160f00cc6af364650d1139d904a25 Running setup.py bdist_wheel for socketpool ... done Stored in directory: /home/mnaderan/.cache/pip/wheels/93/f6/8c/65924848766618647078cb66b1d964e8b80876536e84517469 Successfully built collect couchdbkit restkit http-parser socketpool Installing collected packages: http-parser, socketpool, restkit, couchdbkit, collect Successfully installed collect-0.1.1 couchdbkit-0.6.5 http-parser-0.8.3 restkit-4.2.2 socketpool-0.5.3 However, pip3 fails with this error $ pip3 install --user collect Collecting collect Using cached https://files.pythonhosted.org/packages/cf/5e/c0f0f51d081665374a2c219ea4ba23fb1e179b70dded96dc16606786d828/collect-0.1.1.tar.gz Collecting couchdbkit>=0.5.7 (from collect) Using cached https://files.pythonhosted.org/packages/a1/13/9e9ff695a385c44f62b4766341b97f2bd8b596962df2a0beabf358468b70/couchdbkit-0.6.5.tar.gz Complete output from command python setup.py egg_info: Traceback (most recent call last): File "", line 1, in File "/tmp/pip-build-qf95n0tt/couchdbkit/setup.py", line 25, in long_description = file( NameError: name 'file' is not defined ---------------------------------------- Command "python setup.py egg_info" failed with error code 1 in /tmp/pip-build-qf95n0tt/couchdbkit/ I can not figure out what is the problem. Any way to fix that? More info: $ which python /usr/bin/python $ ls -l /usr/bin/python lrwxrwxrwx 1 root root 9 Apr 16 2018 /usr/bin/python -> python2.7 $ which python3 /usr/bin/python3 $ ls -l /usr/bin/python3 lrwxrwxrwx 1 root root 9 Jun 21 2018 /usr/bin/python3 -> python3.6 Regards, Mahmood From rainer.woitok at gmail.com Fri Dec 20 07:19:26 2019 From: rainer.woitok at gmail.com (Dr Rainer Woitok) Date: Fri, 20 Dec 2019 13:19:26 +0100 Subject: Problems with "Tarfile.close()" Message-ID: <24060.48206.298944.886236@woitok.gmail.com> Greetings, One of my Python scripts basically does the following: source = tarfile.open(name=tar_archive , mode='r|*') dest = tarfile.open(fileobj=sys.stdout, mode='w|', format=fmt) . . . source.close() dest.close() In an attempt to move my Python scripts from Python 2.7 to Python 3.6 I ran into the problem that under Python 3.6 the call to "dest.close()" fails: Traceback (most recent call last): File ".../tar_archive.copy", line 137, in dest.close() File "/usr/lib64/python3.6/tarfile.py", line 1742, in close self.fileobj.close() File "/usr/lib64/python3.6/tarfile.py", line 467, in close self.fileobj.write(self.buf) TypeError: write() argument must be str, not bytes What am I doing wrong? By the way: since on some hosts this script is running on the transition from Python 2.7 to Python 3.x will not happen immediately, I need a solution which works with both versions. Sincerely, Rainer From rosuav at gmail.com Fri Dec 20 10:30:39 2019 From: rosuav at gmail.com (Chris Angelico) Date: Sat, 21 Dec 2019 02:30:39 +1100 Subject: Unable to install "collect" via pip3 In-Reply-To: <414413727.1095151.1576826520979@mail.yahoo.com> References: <414413727.1095151.1576826520979.ref@mail.yahoo.com> <414413727.1095151.1576826520979@mail.yahoo.com> Message-ID: On Sat, Dec 21, 2019 at 2:25 AM Mahmood Naderan via Python-list wrote: > > Hi > > I can install collect with pip for python2.7 > $ pip install --user collect > However, pip3 fails with this error > $ pip3 install --user collect > NameError: name 'file' is not defined > > I can not figure out what is the problem. Any way to fix that? > Are you trying to install this package? https://pypi.org/project/collect/ It's alpha software that does not claim to support Python 3. The last release was in 2011. The web site it links to is defunct. Maybe that wasn't what you intended to install? Check the spelling of the package you're trying to install - maybe it's named slightly differently. ChrisA From rosuav at gmail.com Fri Dec 20 10:34:28 2019 From: rosuav at gmail.com (Chris Angelico) Date: Sat, 21 Dec 2019 02:34:28 +1100 Subject: Problems with "Tarfile.close()" In-Reply-To: <24060.48206.298944.886236@woitok.gmail.com> References: <24060.48206.298944.886236@woitok.gmail.com> Message-ID: On Sat, Dec 21, 2019 at 2:29 AM Dr Rainer Woitok wrote: > > Greetings, > > One of my Python scripts basically does the following: > > source = tarfile.open(name=tar_archive , mode='r|*') > dest = tarfile.open(fileobj=sys.stdout, mode='w|', format=fmt) > > . > . > . > > source.close() > dest.close() > > In an attempt to move my Python scripts from Python 2.7 to Python 3.6 I > ran into the problem that under Python 3.6 the call to "dest.close()" > fails: (I think the fact that it fails when you close the file is a red herring; it would fail at some point, and it happens to hold things over until it closes.) > Traceback (most recent call last): > File ".../tar_archive.copy", line 137, in > dest.close() > File "/usr/lib64/python3.6/tarfile.py", line 1742, in close > self.fileobj.close() > File "/usr/lib64/python3.6/tarfile.py", line 467, in close > self.fileobj.write(self.buf) > TypeError: write() argument must be str, not bytes > > What am I doing wrong? By the way: since on some hosts this script is > running on the transition from Python 2.7 to Python 3.x will not happen > immediately, I need a solution which works with both versions. > Possibly the easiest fix would be to open all files in binary mode. I think your content is binary anyway by the look of it. Just add the letter "b" to the end of your open file modes, and on Py2, it'll ensure that no newline conversion happens (mainly an issue on Windows, which is why you probably don't feel the need to do this), but on Py3, it means that it expects bytestrings everywhere. If that *doesn't* work, then you may need to mark some of your strings as binary, if you have unadorned strings containing ASCII data. ChrisA From ethan at stoneleaf.us Fri Dec 20 10:41:51 2019 From: ethan at stoneleaf.us (Ethan Furman) Date: Fri, 20 Dec 2019 07:41:51 -0800 Subject: Problems with "Tarfile.close()" In-Reply-To: <24060.48206.298944.886236@woitok.gmail.com> References: <24060.48206.298944.886236@woitok.gmail.com> Message-ID: <213f5084-5007-8a87-5ac3-83cbd734e585@stoneleaf.us> On 12/20/2019 04:19 AM, Dr Rainer Woitok wrote: > One of my Python scripts basically does the following: > > source = tarfile.open(name=tar_archive , mode='r|*') > dest = tarfile.open(fileobj=sys.stdout, mode='w|', format=fmt) > > . > . > . > > source.close() > dest.close() > > In an attempt to move my Python scripts from Python 2.7 to Python 3.6 I > ran into the problem that under Python 3.6 the call to "dest.close()" > fails: > > Traceback (most recent call last): > File ".../tar_archive.copy", line 137, in > dest.close() > File "/usr/lib64/python3.6/tarfile.py", line 1742, in close > self.fileobj.close() > File "/usr/lib64/python3.6/tarfile.py", line 467, in close > self.fileobj.write(self.buf) > TypeError: write() argument must be str, not bytes In Python 3 `sys.stdout` is a character interface, not bytes. There are a couple solutions to the Python 3 aspect of the problem here: https://stackoverflow.com/q/908331/208880 If those answers do not work on Python 2 you'll need to detect which version you are on and act appropriately, perhaps hiding that bit of complexity in a function or class. -- ~Ethan~ From barry at barrys-emacs.org Fri Dec 20 10:01:41 2019 From: barry at barrys-emacs.org (Barry Scott) Date: Fri, 20 Dec 2019 15:01:41 +0000 Subject: How to improve epoll speed when recv from kernel via netlink? In-Reply-To: References: Message-ID: > On 20 Dec 2019, at 04:33, lampahome wrote: > > I tried to receive msg from kernel via netlink of socket. > > And I use epoll to receive netlink events whenever it comes from kernel to > user space. > > But I found the performance is poor e.g. epoll costs 90% time of execution > time after I profile it by cProfile module. > > Are there any tips to improve this? cProfile is telling you how long the code is waiting for epoll to return. It is not telling that epoll is a problem. Barry > -- > https://mail.python.org/mailman/listinfo/python-list > From p4j at j4d.net Fri Dec 20 12:16:38 2019 From: p4j at j4d.net (p4j at j4d.net) Date: Fri, 20 Dec 2019 22:46:38 +0530 Subject: [HN] How to Make Python Wait References:

Message-ID: >> https://news.ycombinator.com/item?id=21834408 > Did you just post a blog article, then spam everywhere to try to get > traffic, where your entire blog post is telling people worse ways to > do a time.sleep()? Blog post is not mine. I have a habit of posting good things to HN. When I got notification that it is getting good traction, people are discussing, then I posted the discussion link here. If that is not liked here then I won't post again. From rainer.woitok at gmail.com Fri Dec 20 12:20:11 2019 From: rainer.woitok at gmail.com (Dr Rainer Woitok) Date: Fri, 20 Dec 2019 18:20:11 +0100 Subject: Problems with "Tarfile.close()" In-Reply-To: Msg <213f5084-5007-8a87-5ac3-83cbd734e585@stoneleaf.us> of 2019-12-20 07:41:51 -0800 from ethan@stoneleaf.us References: <24060.48206.298944.886236@woitok.gmail.com> <213f5084-5007-8a87-5ac3-83cbd734e585@stoneleaf.us> Message-ID: <24061.715.861556.185941@woitok.gmail.com> Ethan, On Friday, 2019-12-20 07:41:51 -0800, you wrote: > ... > In Python 3 `sys.stdout` is a character interface, not bytes. Does that mean that with Python 3 "Tarfile" is no longer able to write the "tar" file to a pipe? Or is there now another way to write to a pipe? And if that new way also worked with Python 2, it would be even better ... :-) > There are a couple solutions to the Python 3 aspect of the problem here: > > https://stackoverflow.com/q/908331/208880 Using "sys.stdout.buffer" seems to work in Python 3 (at least with my current rather trivial test case) but does not work in Python 2. Quest- ion: what is the cheapest way to retrieve the Python version the script is executing in? Sincerely, Rainer From __peter__ at web.de Fri Dec 20 13:01:34 2019 From: __peter__ at web.de (Peter Otten) Date: Fri, 20 Dec 2019 19:01:34 +0100 Subject: on sorting things References:

Message-ID: Eli the Bearded wrote: > In comp.lang.python, Peter Otten <__peter__ at web.de> wrote: >> Eli the Bearded wrote: >>> But what caught my eye most, as someone relatively new to Python but >>> with long experience in C in Perl, is sorting doesn't take a > > s/C in /C and/ > > Ugh. > >>> *comparison* function, it takes a *key generator* function, and that >>> function is supposed to transform the thing into something that the >>> native comparison knows how to compare. >>> >>> This seems a strange choice, and I'm wondering if someone can explain >>> the benefits of doing it that way to me. >> >> Python 2 started with a comparison function and then grew a key function. >> With a key function you still have to compare items, you are just >> breaking the comparison into two steps: > > [snip] > > Thanks for that good explanation. The benchmark comparison makes it > very thorough. > > In my mind I gravitate towards the complicated sorts of sort that can be > quickly compared for some sorts of keys and not as quickly for others. > > Consider a sort that first compares file size and if the same number of > bytes, then compares file checksum. Any decently scaled real world > implementation would memoize the checksum for speed, but only work it out > for files that do not have a unique file size. The key method requires > it worked out in advance for everything. Oscar already mentioned the functools.cmp_to_key decorator which makes this a non-issue: def mycmp(a, b): ... files.sort(key=cmp_to_key(mycmp)) Applied to your example, with memoization: # untested def cmp(a, b): return (a > b)-(a < b) def make_file_key(): size = functools.lru_cache(None)(getsize) checksum = functools.lru_cache(None)(getchecksum) @functools.cmp_to_key def file_key(a, b): return cmp(size(a), size(b)) or cmp(checksum(a), checksum(b)) return file_key files.sort(key=make_file_key()) > But I see the key method handles the memoization under the hood for you, > so those simpler, more common sorts of sort get an easy to see benefit. > > Elijah > ------ > even memoizing the stat() calls would help for large lists PS: If you are sorting files by size and checksum as part of a deduplication effort consider using dict-s instead: def grouped(items, key): result = defaultdict(list) for item in items: result[key(item)].append(item) return result for same_size in grouped(files, key=getsize).values(): if len(same_size) > 1: for same_checksum in grouped(same_size, key=getchecksum).values(): if len(same_checksum) > 1: print(same_checksum) From rosuav at gmail.com Fri Dec 20 13:11:21 2019 From: rosuav at gmail.com (Chris Angelico) Date: Sat, 21 Dec 2019 05:11:21 +1100 Subject: on sorting things In-Reply-To: References:

Message-ID: On Sat, Dec 21, 2019 at 5:03 AM Peter Otten <__peter__ at web.de> wrote: > PS: If you are sorting files by size and checksum as part of a deduplication > effort consider using dict-s instead: Yeah, I'd agree if that's the purpose. But let's say the point is to have a guaranteed-stable ordering of files that are primarily to be sorted by file size - in order to ensure that two files are in the same order every time you refresh the view, they get sorted by their checksums. There ARE good reasons to do weird things with sorting, and a custom key object (either with cmp_to_key or directly implemented) can do that. ChrisA From lele at metapensiero.it Fri Dec 20 13:33:47 2019 From: lele at metapensiero.it (Lele Gaifax) Date: Fri, 20 Dec 2019 19:33:47 +0100 Subject: Cython, producing different modules from the same .pyx References: <87a77osben.fsf@metapensiero.it> Message-ID: <87h81uyhh0.fsf@metapensiero.it> Greg Ewing writes: > You could try creating a set of top-level .pyx stubs, each of > which just 'include' the real code. Thank you, will try this approach! ciao, lele. -- nickname: Lele Gaifax | Quando vivr? di quello che ho pensato ieri real: Emanuele Gaifas | comincer? ad aver paura di chi mi copia. lele at metapensiero.it | -- Fortunato Depero, 1929. From lele at metapensiero.it Fri Dec 20 13:35:04 2019 From: lele at metapensiero.it (Lele Gaifax) Date: Fri, 20 Dec 2019 19:35:04 +0100 Subject: Cython, producing different modules from the same .pyx References: <87a77osben.fsf@metapensiero.it> <897ebc73-44f1-461f-28e1-9119c76538a4@stoneleaf.us> Message-ID: <87a77myhev.fsf@metapensiero.it> Ethan Furman writes: > If you don't get an answer here, you can try the Cython Users group: Thanks, reposted the same question there. ciao, lele. -- nickname: Lele Gaifax | Quando vivr? di quello che ho pensato ieri real: Emanuele Gaifas | comincer? ad aver paura di chi mi copia. lele at metapensiero.it | -- Fortunato Depero, 1929. From __peter__ at web.de Fri Dec 20 13:59:53 2019 From: __peter__ at web.de (Peter Otten) Date: Fri, 20 Dec 2019 19:59:53 +0100 Subject: on sorting things References:

Message-ID: Chris Angelico wrote: > On Sat, Dec 21, 2019 at 5:03 AM Peter Otten <__peter__ at web.de> wrote: >> PS: If you are sorting files by size and checksum as part of a >> deduplication effort consider using dict-s instead: > > Yeah, I'd agree if that's the purpose. But let's say the point is to > have a guaranteed-stable ordering of files that are primarily to be > sorted by file size - in order to ensure that two files are in the > same order every time you refresh the view, they get sorted by their > checksums. One thing that struck me about Eli's example is that it features two key functions rather than a complex comparison. If sort() would accept a sequence of key functions each function could be used to sort slices that compare equal when using the previous key. > There ARE good reasons to do weird things with sorting, and a custom > key object (either with cmp_to_key or directly implemented) can do > that. Indeed. From __peter__ at web.de Fri Dec 20 14:12:11 2019 From: __peter__ at web.de (Peter Otten) Date: Fri, 20 Dec 2019 20:12:11 +0100 Subject: Problems with "Tarfile.close()" References: <24060.48206.298944.886236@woitok.gmail.com> <213f5084-5007-8a87-5ac3-83cbd734e585@stoneleaf.us> <24061.715.861556.185941@woitok.gmail.com> Message-ID: Dr Rainer Woitok wrote: > Ethan, > > On Friday, 2019-12-20 07:41:51 -0800, you wrote: > >> ... >> In Python 3 `sys.stdout` is a character interface, not bytes. > > Does that mean that with Python 3 "Tarfile" is no longer able to write > the "tar" file to a pipe? Or is there now another way to write to a > pipe? And if that new way also worked with Python 2, it would be even > better ... :-) > >> There are a couple solutions to the Python 3 aspect of the problem here: >> >> https://stackoverflow.com/q/908331/208880 > > Using "sys.stdout.buffer" seems to work in Python 3 (at least with my > current rather trivial test case) but does not work in Python 2. Quest- > ion: what is the cheapest way to retrieve the Python version the script > is executing in? While I didn't look into the stackoverflow page an easy way to get something that accepts bytes may be # untested stdout = sys.stdout try: stdout = stdout.buffer except AttributeError: pass tarfile.open(fileobj=stdout, ...) From rosuav at gmail.com Fri Dec 20 14:44:03 2019 From: rosuav at gmail.com (Chris Angelico) Date: Sat, 21 Dec 2019 06:44:03 +1100 Subject: on sorting things In-Reply-To: References:

Message-ID: On Sat, Dec 21, 2019 at 6:01 AM Peter Otten <__peter__ at web.de> wrote: > > Chris Angelico wrote: > > > On Sat, Dec 21, 2019 at 5:03 AM Peter Otten <__peter__ at web.de> wrote: > >> PS: If you are sorting files by size and checksum as part of a > >> deduplication effort consider using dict-s instead: > > > > Yeah, I'd agree if that's the purpose. But let's say the point is to > > have a guaranteed-stable ordering of files that are primarily to be > > sorted by file size - in order to ensure that two files are in the > > same order every time you refresh the view, they get sorted by their > > checksums. > > One thing that struck me about Eli's example is that it features two key > functions rather than a complex comparison. > > If sort() would accept a sequence of key functions each function could be > used to sort slices that compare equal when using the previous key. > That would basically make it a comparison function, not a key function :) The value of the key function is that it's called exactly once per element, and the result is what you sort by. There's no correlation to any other element. It's effectively this: # sortme.sort(key=keyfunc) keys = sortme.map(keyfunc) keys.sort(keep_in_parallel=sortme) Which is why cmp_to_key is the correct solution to that problem. (For cases where you don't mind pre-calling both key functions, of course, it's equivalent to a single key function that returns a tuple.) ChrisA From rosuav at gmail.com Fri Dec 20 14:45:24 2019 From: rosuav at gmail.com (Chris Angelico) Date: Sat, 21 Dec 2019 06:45:24 +1100 Subject: Problems with "Tarfile.close()" In-Reply-To: References: <24060.48206.298944.886236@woitok.gmail.com> <213f5084-5007-8a87-5ac3-83cbd734e585@stoneleaf.us> <24061.715.861556.185941@woitok.gmail.com> Message-ID: On Sat, Dec 21, 2019 at 6:13 AM Peter Otten <__peter__ at web.de> wrote: > > Dr Rainer Woitok wrote: > > > Ethan, > > > > On Friday, 2019-12-20 07:41:51 -0800, you wrote: > > > >> ... > >> In Python 3 `sys.stdout` is a character interface, not bytes. > > > > Does that mean that with Python 3 "Tarfile" is no longer able to write > > the "tar" file to a pipe? Or is there now another way to write to a > > pipe? And if that new way also worked with Python 2, it would be even > > better ... :-) > > > >> There are a couple solutions to the Python 3 aspect of the problem here: > >> > >> https://stackoverflow.com/q/908331/208880 > > > > Using "sys.stdout.buffer" seems to work in Python 3 (at least with my > > current rather trivial test case) but does not work in Python 2. Quest- > > ion: what is the cheapest way to retrieve the Python version the script > > is executing in? > > While I didn't look into the stackoverflow page an easy way to get something > that accepts bytes may be > > # untested > stdout = sys.stdout > try: > stdout = stdout.buffer > except AttributeError: > pass This construct can be simplified down to: stdout = getattr(sys.stdout, "buffer", sys.stdout) ChrisA From tjreedy at udel.edu Fri Dec 20 14:54:49 2019 From: tjreedy at udel.edu (Terry Reedy) Date: Fri, 20 Dec 2019 14:54:49 -0500 Subject: Unable to install "collect" via pip3 In-Reply-To: <414413727.1095151.1576826520979@mail.yahoo.com> References: <414413727.1095151.1576826520979.ref@mail.yahoo.com> <414413727.1095151.1576826520979@mail.yahoo.com> Message-ID: On 12/20/2019 2:22 AM, Mahmood Naderan via Python-list wrote: > However, pip3 fails with this error > $ pip3 install --user collect > Collecting collect > Using cached https://files.pythonhosted.org/packages/cf/5e/c0f0f51d081665374a2c219ea4ba23fb1e179b70dded96dc16606786d828/collect-0.1.1.tar.gz > Collecting couchdbkit>=0.5.7 (from collect) > Using cached https://files.pythonhosted.org/packages/a1/13/9e9ff695a385c44f62b4766341b97f2bd8b596962df2a0beabf358468b70/couchdbkit-0.6.5.tar.gz > Complete output from command python setup.py egg_info: > Traceback (most recent call last): > File "", line 1, in > File "/tmp/pip-build-qf95n0tt/couchdbkit/setup.py", line 25, in > long_description = file( > NameError: name 'file' is not defined The builtin function 'file' does not exist in 3.x. pip3 is trying to install 2.x code. I suspect this is because the package is not properly labelled as 2.x only. (Some 2.x code will run unaltered on 3.x, so pip should try even if 3.x is not specified.) -- Terry Jan Reedy From barry at barrys-emacs.org Fri Dec 20 17:10:24 2019 From: barry at barrys-emacs.org (Barry) Date: Fri, 20 Dec 2019 22:10:24 +0000 Subject: Unable to install "collect" via pip3 In-Reply-To: <414413727.1095151.1576826520979@mail.yahoo.com> References: <414413727.1095151.1576826520979@mail.yahoo.com> Message-ID: > On 20 Dec 2019, at 15:27, Mahmood Naderan via Python-list wrote: > > ?Hi > > I can install collect with pip for python2.7 > $ pip install --user collect > Collecting collect > Using cached https://files.pythonhosted.org/packages/cf/5e/c0f0f51d081665374a2c219ea4ba23fb1e179b70dded96dc16606786d828/collect-0.1.1.tar.gz > Collecting couchdbkit>=0.5.7 (from collect) > Using cached https://files.pythonhosted.org/packages/a1/13/9e9ff695a385c44f62b4766341b97f2bd8b596962df2a0beabf358468b70/couchdbkit-0.6.5.tar.gz > Collecting restkit>=4.2.2 (from couchdbkit>=0.5.7->collect) > Downloading https://files.pythonhosted.org/packages/76/b9/d90120add1be718f853c53008cf5b62d74abad1d32bd1e7097dd913ae053/restkit-4.2.2.tar.gz (1.3MB) > 100% |????????????????????????????????| 1.3MB 633kB/s > Collecting http-parser>=0.8.3 (from restkit>=4.2.2->couchdbkit>=0.5.7->collect) > Downloading https://files.pythonhosted.org/packages/07/c4/22e3c76c2313c26dd5f84f1205b916ff38ea951aab0c4544b6e2f5920d64/http-parser-0.8.3.tar.gz (83kB) > 100% |????????????????????????????????| 92kB 2.4MB/s > Collecting socketpool>=0.5.3 (from restkit>=4.2.2->couchdbkit>=0.5.7->collect) > Downloading https://files.pythonhosted.org/packages/d1/39/fae99a735227234ffec389b252c6de2bc7816bf627f56b4c558dc46c85aa/socketpool-0.5.3.tar.gz > Building wheels for collected packages: collect, couchdbkit, restkit, http-parser, socketpool > Running setup.py bdist_wheel for collect ... done > Stored in directory: /home/mnaderan/.cache/pip/wheels/b9/7c/7c/b09b334cc0e27b4f63ee9f6f19ca1f3db8672666a7e0f3d9cd > Running setup.py bdist_wheel for couchdbkit ... done > Stored in directory: /home/mnaderan/.cache/pip/wheels/f6/05/1b/f8f576ef18564bc68ab6e64f405e1263448036208cafb221e0 > Running setup.py bdist_wheel for restkit ... done > Stored in directory: /home/mnaderan/.cache/pip/wheels/48/c5/32/d0d25fb272791a68c49c26150f332d9b9492d0bc9ea0cdd2c7 > Running setup.py bdist_wheel for http-parser ... done > Stored in directory: /home/mnaderan/.cache/pip/wheels/22/db/06/cb609a3345e7aa87206de160f00cc6af364650d1139d904a25 > Running setup.py bdist_wheel for socketpool ... done > Stored in directory: /home/mnaderan/.cache/pip/wheels/93/f6/8c/65924848766618647078cb66b1d964e8b80876536e84517469 > Successfully built collect couchdbkit restkit http-parser socketpool > Installing collected packages: http-parser, socketpool, restkit, couchdbkit, collect > Successfully installed collect-0.1.1 couchdbkit-0.6.5 http-parser-0.8.3 restkit-4.2.2 socketpool-0.5.3 > However, pip3 fails with this error > $ pip3 install --user collect > Collecting collect > Using cached https://files.pythonhosted.org/packages/cf/5e/c0f0f51d081665374a2c219ea4ba23fb1e179b70dded96dc16606786d828/collect-0.1.1.tar.gz > Collecting couchdbkit>=0.5.7 (from collect) > Using cached https://files.pythonhosted.org/packages/a1/13/9e9ff695a385c44f62b4766341b97f2bd8b596962df2a0beabf358468b70/couchdbkit-0.6.5.tar.gz > Complete output from command python setup.py egg_info: > Traceback (most recent call last): > File "", line 1, in > File "/tmp/pip-build-qf95n0tt/couchdbkit/setup.py", line 25, in > long_description = file( > NameError: name 'file' is not defined My guess is that file is python 2 only. Couchdbkit needs porting to python 3. Barry > > ---------------------------------------- > Command "python setup.py egg_info" failed with error code 1 in /tmp/pip-build-qf95n0tt/couchdbkit/ > I can not figure out what is the problem. Any way to fix that? > > More info: > $ which python > /usr/bin/python > $ ls -l /usr/bin/python > lrwxrwxrwx 1 root root 9 Apr 16 2018 /usr/bin/python -> python2.7 > $ which python3 > /usr/bin/python3 > $ ls -l /usr/bin/python3 > lrwxrwxrwx 1 root root 9 Jun 21 2018 /usr/bin/python3 -> python3.6 > > > > Regards, > Mahmood > -- > https://mail.python.org/mailman/listinfo/python-list From greg.ewing at canterbury.ac.nz Fri Dec 20 20:50:45 2019 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Sat, 21 Dec 2019 14:50:45 +1300 Subject: How to extend an object? In-Reply-To: References:

content of e-mail