Changelog picked from https://github.com/tdewolff/parse/releases
v2.4.2
Add many hashes for CSS
Add amp-boilerplate hash for HTML
NewError now has fmt-style function signature
Append 0x00 to signal EOF even if source already ends in 0x00 (which may be valid)
v2.4.1
CSS: add Invert and Solid hashes
v2.4.0
XML and HTML: Text() []byte returns nil for start tag closers
XML and HTML: Text() []byte returns textual content for TextToken, CommentToken, ...
Added Offset() int for all lexers and parsers that returns the current character offset in the input stream
CSS: EOF after \ now properly handled as DelimToken and not EscapeToken in some cases
v2.3.15
Bugfix: bad URL encoding resulted in no decoding at all, the new EncodeURL and DecodeURL are faster and never fail
Bugfix: get correct Position if it is in the middle of a unicode codepoint
v2.3.14
Re-parse input immediately when encountering parse error. Previously this was only done if the error was actually read which would save us from re-parsing the file (i.e. better performance). However, an error is (a) rare and (b) happens only once per file. Re-parsing on errors does not impact the performance of well formatted files, and a reliable error message is valued more.
Bugfix: prevent infinite loop on error on input file that contains unicode code points (i.e. any character bigger than 0xC0 usually followed by more bytes).
Improve error messages for parsers to include parser name and print offending byte(s)
v2.3.13
Improve performance of ReplaceMultipleWhitespace by 20%--25%
Add ReplaceEntities and ReplaceMultipleWhitespaceAndEntities to replace QuoteEntity. These allow to do replacements such as: " => ", " => ", " => ", ’ => ’, ' => '.
Update list of HTML entity rewrites
v2.3.12
Revert v2.3.11 and readd html.Keygen hash.
v2.3.11
Add ReplaceEntities and remove entity replacement from EscapeAttrVal
v2.3.10
Add and remove hashes for CSS and HTML
v2.3.9
CSS: keep whitespace in unknown at rules
v2.3.8
Remove import comments
Fix bug in CSS parsing when encountering } in CSS inline
v2.3.7
Fix position information on errors
CSS: fix error position
v2.3.6
Prevent panic when returning Bytes() on empty buffer
Set proper context in errors
Limit the length in context in errors
In EqualFold only match upper/lower case on alphabet characters
Add option to EscapeAttrVal to handle attribute values for XML (use double quotes)
HTML: add hashes for RDFa attribute names
JS: accept NULL characters as code
JSON: report errors NULL characters
The buffer package specifies a canonical import path, but elsewhere in
the code of the package, it is imported with a different path (containing
/v2/). This has been broken since Go 1.12.
This package contains several lexers and parsers written in Go.
All subpackages are built to be streaming, high performance and to
be in accordance with the official (latest) specifications.
The lexers are implemented using buffer.Lexer in
https://github.com/tdewolff/parse/buffer and the parsers work on
top of the lexers. Some subpackages have hashes defined (using
Hasher) that speed up common byte-slice comparisons.