Relaxed, or best-efforts JSON parser for Python?
MRAB
python at mrabarnett.plus.com
Sun Oct 11 22:43:57 EDT 2015
On 2015-10-12 01:56, Victor Hooi wrote:
> Hi,
>
> I'm attempting to parse MongoDB loglines.
>
> The formatting of these loglines could best be described as JSON-like...
>
> For example - arrays
>
> Anyhow, say I had the following logline snippet:
>
> { Global: { acquireCount: { r: 2, w: 2 } }, Database: { acquireCount: { w: 2 } }, Collection: { acquireCount: { w: 1 } }, oplog: { acquireCount: { w: 1 } } }
>
> This won't parse with json.loads() - the main issues is the missing quotation marks (") around the strings.
>
> My question, is there a more lenient, or relaxed JSON parser available for Python, that will try to do a best-efforts parsing of non-spec JSON?
>
Have you tried first adding the quotes using the re module?
>>> import json, re
>>> line = '{ Global: { acquireCount: { r: 2, w: 2 } }, Database: {
acquireCount: { w: 2 } }, Collection: { acquireCount: { w: 1 } }, oplog:
{ acquireCount: { w: 1 } } }'
>>> json.loads(re.sub(r'(\w+)', r'"\1"', line))
More information about the Python-list
mailing list