html5-parser 0.4.11-ok1.1 source package in openKylin
Changelog
html5-parser (0.4.11-ok1.1) nile; urgency=medium * Rebuild for python 3.12. -- rtlhq <nobelxyz@163.com> Sat, 25 May 2024 10:07:16 +0000
html5-parser (0.4.11-ok1.1) nile; urgency=medium * Rebuild for python 3.12. -- rtlhq <nobelxyz@163.com> Sat, 25 May 2024 10:07:16 +0000
Series | Published | Component | Section |
---|
File | Size | SHA-256 Checksum |
---|---|---|
html5-parser_0.4.11.orig.tar.gz | 264.4 KiB | 85bb16faa3cdf3c6c6842e0cb2ce02807cbaefc6e3bb0f3b8e18dabe51070763 |
html5-parser_0.4.11-ok1.1.debian.tar.xz | 2.4 KiB | cfda7afc3324d7a624fa35a1da8841465500d6fe5e61461cb35b6af4ae8f9a74 |
html5-parser_0.4.11-ok1.1.dsc | 1.8 KiB | 94ec71b38294d1e3ae3269bb72925a3caaa8eb5126b56b2df00cd23b05458f37 |
A fast implementation of the HTML 5 parsing spec for Python. Parsing is
done in C using a variant of the gumbo parser. The gumbo parse tree is
then transformed into an lxml tree, also in C, yielding parse times that
can be a thirtieth of the html5lib parse times. That is a speedup of 30x.
This differs, for instance, from the gumbo python bindings, where the
initial parsing is done in C but the transformation into the final
tree is done in python.