[New-bugs-announce] [issue44289] tarfile.is_tarfile() modifies file object's current position

Andrzej Mateja report at bugs.python.org
Wed Jun 2 11:09:05 EDT 2021


New submission from Andrzej Mateja <mateja.and at gmail.com>:

Since Python 3.9 tarfile.is_tarfile accepts not only paths but also files and file-like objects (bpo-29435).

Verification if a file or file-like object is a tar file modifies file object's current position.

Imagine a function listing names of all tar archive members but checking first if this is a valid tar archive. When its argument is a str or pathlib.Path this is quite straightforward. If the argument is a file of file-like object then current position must be reset or TarFile.getmembers() returns empty list.

import tarfile


def list_tar(archive):
    if tarfile.is_tarfile(archive):
        kwargs = {'fileobj' if hasattr(archive, 'read') else 'name': archive}
        t = tarfile.open(**kwargs)
        return [member.name for member in t.getmembers()]
    return []


if __name__ == '__main__':
    path = 'archive.tar.gz'
    print(list_tar(path))
    print(list_tar(open(path, 'rb')))


['spam.py', 'ham.py', 'bacon.py', 'eggs.py']
[]

----------
components: Library (Lib)
messages: 394922
nosy: mateja.and
priority: normal
severity: normal
status: open
title: tarfile.is_tarfile() modifies file object's current position
type: behavior
versions: Python 3.10, Python 3.11, Python 3.9

_______________________________________
Python tracker <report at bugs.python.org>
<https://bugs.python.org/issue44289>
_______________________________________


More information about the New-bugs-announce mailing list