[Python-checkins] bpo-44335: Fix a regression when identifying invalid characters in syntax errors (GH-26589)

miss-islington webhook-mailer at python.org
Tue Jun 8 07:47:05 EDT 2021


https://github.com/python/cpython/commit/933b5b63598968c1ab4976f92570696a33c72cc4
commit: 933b5b63598968c1ab4976f92570696a33c72cc4
branch: 3.10
author: Miss Islington (bot) <31488909+miss-islington at users.noreply.github.com>
committer: miss-islington <31488909+miss-islington at users.noreply.github.com>
date: 2021-06-08T04:46:56-07:00
summary:

bpo-44335: Fix a regression when identifying invalid characters in syntax errors (GH-26589)

(cherry picked from commit d334c73b56756e90c33ce06e3a6ec23271aa099d)

Co-authored-by: Pablo Galindo <Pablogsal at gmail.com>

files:
A Misc/NEWS.d/next/Core and Builtins/2021-06-08-01-13-47.bpo-44335.GQTTkl.rst
M Lib/test/test_exceptions.py
M Parser/pegen.c

diff --git a/Lib/test/test_exceptions.py b/Lib/test/test_exceptions.py
index 5fb651f4c22e5..df5778d7e5f6a 100644
--- a/Lib/test/test_exceptions.py
+++ b/Lib/test/test_exceptions.py
@@ -211,6 +211,7 @@ def testSyntaxErrorOffset(self):
         check('lambda x: x = 2', 1, 1)
         check('f{a + b + c}', 1, 2)
         check('[file for str(file) in []\n])', 2, 2)
+        check('a = « hello » « world »', 1, 5)
         check('[\nfile\nfor str(file)\nin\n[]\n]', 3, 5)
         check('[file for\n str(file) in []]', 2, 2)
         check("ages = {'Alice'=22, 'Bob'=23}", 1, 16)
diff --git a/Misc/NEWS.d/next/Core and Builtins/2021-06-08-01-13-47.bpo-44335.GQTTkl.rst b/Misc/NEWS.d/next/Core and Builtins/2021-06-08-01-13-47.bpo-44335.GQTTkl.rst
new file mode 100644
index 0000000000000..b57904e5da607
--- /dev/null
+++ b/Misc/NEWS.d/next/Core and Builtins/2021-06-08-01-13-47.bpo-44335.GQTTkl.rst	
@@ -0,0 +1,2 @@
+Fix a regression when identifying incorrect characters in syntax errors.
+Patch by Pablo Galindo
diff --git a/Parser/pegen.c b/Parser/pegen.c
index aac7e368a799f..c69a042f8de12 100644
--- a/Parser/pegen.c
+++ b/Parser/pegen.c
@@ -1288,7 +1288,9 @@ _PyPegen_run_parser(Parser *p)
         reset_parser_state(p);
         _PyPegen_parse(p);
         if (PyErr_Occurred()) {
-            if (PyErr_ExceptionMatches(PyExc_SyntaxError)) {
+            // Prioritize tokenizer errors to custom syntax errors raised
+            // on the second phase only if the errors come from the parser.
+            if (p->tok->done != E_ERROR && PyErr_ExceptionMatches(PyExc_SyntaxError)) {
                 _PyPegen_check_tokenizer_errors(p);
             }
             return NULL;



More information about the Python-checkins mailing list