HTML parser/tokeniser based for HTML5
A python based HTML parser/tokenizer based on the WHATWG HTML5 specification for maximum compatibility with major desktop web browsers.
Homepage: https://github.com/html5lib/html5lib-python/
Categories: devel, www, textproc