QXmlStreamReader: Raise error on unexpected tokens
authorDebian Qt/KDE Maintainers <debian-qt-kde@lists.debian.org>
Thu, 27 Jul 2023 20:01:32 +0000 (21:01 +0100)
committerDmitry Shachnev <mitya57@debian.org>
Thu, 27 Jul 2023 20:01:32 +0000 (21:01 +0100)
commit680023e3c6500644c6b49b1760dcdece7ff9c9ca
treeef2ce2a5edb036ece6232d37eef0e36726d86741
parent2dc7e6192d117fa2a1992a31165327fe937c60b2
QXmlStreamReader: Raise error on unexpected tokens

Origin: upstream, https://download.qt.io/official_releases/qt/5.15/CVE-2023-38197-qtbase-5.15.diff
Last-Update: 2023-07-15

QXmlStreamReader accepted multiple DOCTYPE elements, containing DTD
fragments in the XML prolog, and in the XML body.
Well-formed but invalid XML files - with multiple DTD fragments in
prolog and body, combined with recursive entity expansions - have
caused infinite loops in QXmlStreamReader.

This patch implements a token check in QXmlStreamReader.
A stream is allowed to start with an XML prolog. StartDocument
and DOCTYPE elements are only allowed in this prolog, which
may also contain ProcessingInstruction and Comment elements.
As soon as anything else is seen, the prolog ends.
After that, the prolog-specific elements are treated as unexpected.
Furthermore, the prolog can contain at most one DOCTYPE element.

Update the documentation to reflect the new behavior.
Add an autotest that checks the new error cases are correctly detected,
and no error is raised for legitimate input.

The original OSS-Fuzz files (see bug reports) are not included in this
patch for file size reasons. They have been tested manually. Each of
them has more than one DOCTYPE element, causing infinite loops in
recursive entity expansions. The newly implemented functionality
detects those invalid DTD fragments. By raising an error, it aborts
stream reading before an infinite loop occurs.

Thanks to OSS-Fuzz for finding this.

Gbp-Pq: Name CVE-2023-38197.diff
src/corelib/serialization/qxmlstream.cpp
src/corelib/serialization/qxmlstream_p.h
tests/auto/corelib/serialization/qxmlstream/tokenError/dtdInBody.xml [new file with mode: 0644]
tests/auto/corelib/serialization/qxmlstream/tokenError/multipleDtd.xml [new file with mode: 0644]
tests/auto/corelib/serialization/qxmlstream/tokenError/wellFormed.xml [new file with mode: 0644]
tests/auto/corelib/serialization/qxmlstream/tst_qxmlstream.cpp