py-chardet: update to 5.0.0.
This release is the first release of chardet that no longer
supports Python < 3.6.
In addition to that change, it features the following user-facing
changes:
Added a prober for Johab Korean (#207, @grizlupo)
Added a prober for UTF-16/32 BE/LE (#109, #206, @jpz)
Added test data for Croatian, Czech, Hungarian, Polish, Slovak,
Slovene, Greek, and Turkish, which should help prevent future
errors with those languages
Improved XML tag filtering, which should improve accuracy for
XML files (#208)
Tweaked SingleByteCharSetProber confidence to match latest
uchardet (#209)
Made detect_all return child prober confidences (#210)
Updated examples in docs (#223, @domdfcoding)
Documentation fixes (#212, #224, #225, #226, #220, #221, #244
from too many to mention)
Minor performance improvements (#252, @deedy5)
Add support for Python 3.10 when testing (#232, @jdufresne)
Lots of little development cycle improvements, mostly thanks
to @jdufresne
2022-07-03 14:45:17 +02:00
|
|
|
@comment $NetBSD: PLIST,v 1.11 2022/07/03 12:45:17 wiz Exp $
|
2016-02-05 13:40:56 +01:00
|
|
|
bin/chardetect-${PYVERSSUFFIX}
|
2012-10-20 15:14:48 +02:00
|
|
|
${PYSITELIB}/${EGG_INFODIR}/PKG-INFO
|
|
|
|
${PYSITELIB}/${EGG_INFODIR}/SOURCES.txt
|
|
|
|
${PYSITELIB}/${EGG_INFODIR}/dependency_links.txt
|
2014-01-02 18:55:40 +01:00
|
|
|
${PYSITELIB}/${EGG_INFODIR}/entry_points.txt
|
2012-10-20 15:14:48 +02:00
|
|
|
${PYSITELIB}/${EGG_INFODIR}/top_level.txt
|
2009-11-17 15:29:32 +01:00
|
|
|
${PYSITELIB}/chardet/__init__.py
|
|
|
|
${PYSITELIB}/chardet/__init__.pyc
|
|
|
|
${PYSITELIB}/chardet/__init__.pyo
|
2008-10-08 11:38:46 +02:00
|
|
|
${PYSITELIB}/chardet/big5freq.py
|
|
|
|
${PYSITELIB}/chardet/big5freq.pyc
|
|
|
|
${PYSITELIB}/chardet/big5freq.pyo
|
|
|
|
${PYSITELIB}/chardet/big5prober.py
|
|
|
|
${PYSITELIB}/chardet/big5prober.pyc
|
|
|
|
${PYSITELIB}/chardet/big5prober.pyo
|
|
|
|
${PYSITELIB}/chardet/chardistribution.py
|
|
|
|
${PYSITELIB}/chardet/chardistribution.pyc
|
|
|
|
${PYSITELIB}/chardet/chardistribution.pyo
|
|
|
|
${PYSITELIB}/chardet/charsetgroupprober.py
|
|
|
|
${PYSITELIB}/chardet/charsetgroupprober.pyc
|
|
|
|
${PYSITELIB}/chardet/charsetgroupprober.pyo
|
|
|
|
${PYSITELIB}/chardet/charsetprober.py
|
|
|
|
${PYSITELIB}/chardet/charsetprober.pyc
|
|
|
|
${PYSITELIB}/chardet/charsetprober.pyo
|
Updated py-chardet to 3.0.2.
chardet 3.0.2
Fixes an issue where detect would sometimes return None instead of a dict with the keys encoding, language, and confidence (Issue #113, PR #114).
chardet 3.0.1
This bugfix release fixes a crash in the EUC-TW prober when it encountered certain strings (Issue #67).
chardet 3.0.0
This release is long overdue, but still mostly serves as a placeholder
for the impending 4.0.0 release, which will have retrained models
for better accuracy. For now, this release will get the following
improvements up on PyPI:
Added support for Turkish ISO-8859-9 detection (PR #41, thanks @queeup)
Commented out large unused sections of Big5 and EUC-KR tables to save memory (8bc4b89)
Removed Python 3.2 from testing, but add 3.4 - 3.6
Ensure that stdin is open with mode 'rb' for chardetect CLI. (PR #38, thanks @lpsinger)
Fixed chardetect crash with non-ascii file names (PR #39, thanks @nkanaev)
Made naming conventions more Pythonic throughout (no more mTypicalPositiveRatio, and instead typical_positive_ratio)
Modernized test scripts and infrastructure so we've got Travis testing and all that stuff
Rename filter_without_english_words to filter_international_words and make it match current Mozilla implementation (PR #44, thanks @rsnair2)
Updated filter_english_letters to match C implementation (c665459)
Temporarily disabled Hungarian ISO-8859-2 and Windows-1250 detection because it is very inaccurate (da6c0a0)
Allow CLI sub-package to be importable (PR #55)
Add a hypotheis-based test (PR #66, thanks @DRMacIver)
Strip endianness from UTF with BOM predictions so that the encoding can be passed directly to bytes.decode() (PR #73, thanks @snoack)
Fixed broken links in docs (PR #90, thanks @roskakori)
Added early exit to chardetect when encoding is detected instead of looping through entire file (PR #103, thanks @jpz)
Use bytearray objects internally instead of wrap_ord calls, which provides a nice performance boost across the board (PR #106)
Add language property to probers and UniversalDetector results (PR #180)
Mark the 5 known test failures as such so we can have more useful Travis build results in the meantime (d588407)
2017-04-19 19:24:16 +02:00
|
|
|
${PYSITELIB}/chardet/cli/__init__.py
|
|
|
|
${PYSITELIB}/chardet/cli/__init__.pyc
|
|
|
|
${PYSITELIB}/chardet/cli/__init__.pyo
|
|
|
|
${PYSITELIB}/chardet/cli/chardetect.py
|
|
|
|
${PYSITELIB}/chardet/cli/chardetect.pyc
|
|
|
|
${PYSITELIB}/chardet/cli/chardetect.pyo
|
2008-10-08 11:38:46 +02:00
|
|
|
${PYSITELIB}/chardet/codingstatemachine.py
|
|
|
|
${PYSITELIB}/chardet/codingstatemachine.pyc
|
|
|
|
${PYSITELIB}/chardet/codingstatemachine.pyo
|
2014-01-02 18:55:40 +01:00
|
|
|
${PYSITELIB}/chardet/cp949prober.py
|
|
|
|
${PYSITELIB}/chardet/cp949prober.pyc
|
|
|
|
${PYSITELIB}/chardet/cp949prober.pyo
|
Updated py-chardet to 3.0.2.
chardet 3.0.2
Fixes an issue where detect would sometimes return None instead of a dict with the keys encoding, language, and confidence (Issue #113, PR #114).
chardet 3.0.1
This bugfix release fixes a crash in the EUC-TW prober when it encountered certain strings (Issue #67).
chardet 3.0.0
This release is long overdue, but still mostly serves as a placeholder
for the impending 4.0.0 release, which will have retrained models
for better accuracy. For now, this release will get the following
improvements up on PyPI:
Added support for Turkish ISO-8859-9 detection (PR #41, thanks @queeup)
Commented out large unused sections of Big5 and EUC-KR tables to save memory (8bc4b89)
Removed Python 3.2 from testing, but add 3.4 - 3.6
Ensure that stdin is open with mode 'rb' for chardetect CLI. (PR #38, thanks @lpsinger)
Fixed chardetect crash with non-ascii file names (PR #39, thanks @nkanaev)
Made naming conventions more Pythonic throughout (no more mTypicalPositiveRatio, and instead typical_positive_ratio)
Modernized test scripts and infrastructure so we've got Travis testing and all that stuff
Rename filter_without_english_words to filter_international_words and make it match current Mozilla implementation (PR #44, thanks @rsnair2)
Updated filter_english_letters to match C implementation (c665459)
Temporarily disabled Hungarian ISO-8859-2 and Windows-1250 detection because it is very inaccurate (da6c0a0)
Allow CLI sub-package to be importable (PR #55)
Add a hypotheis-based test (PR #66, thanks @DRMacIver)
Strip endianness from UTF with BOM predictions so that the encoding can be passed directly to bytes.decode() (PR #73, thanks @snoack)
Fixed broken links in docs (PR #90, thanks @roskakori)
Added early exit to chardetect when encoding is detected instead of looping through entire file (PR #103, thanks @jpz)
Use bytearray objects internally instead of wrap_ord calls, which provides a nice performance boost across the board (PR #106)
Add language property to probers and UniversalDetector results (PR #180)
Mark the 5 known test failures as such so we can have more useful Travis build results in the meantime (d588407)
2017-04-19 19:24:16 +02:00
|
|
|
${PYSITELIB}/chardet/enums.py
|
|
|
|
${PYSITELIB}/chardet/enums.pyc
|
|
|
|
${PYSITELIB}/chardet/enums.pyo
|
2008-10-08 11:38:46 +02:00
|
|
|
${PYSITELIB}/chardet/escprober.py
|
|
|
|
${PYSITELIB}/chardet/escprober.pyc
|
|
|
|
${PYSITELIB}/chardet/escprober.pyo
|
|
|
|
${PYSITELIB}/chardet/escsm.py
|
|
|
|
${PYSITELIB}/chardet/escsm.pyc
|
|
|
|
${PYSITELIB}/chardet/escsm.pyo
|
|
|
|
${PYSITELIB}/chardet/eucjpprober.py
|
|
|
|
${PYSITELIB}/chardet/eucjpprober.pyc
|
|
|
|
${PYSITELIB}/chardet/eucjpprober.pyo
|
|
|
|
${PYSITELIB}/chardet/euckrfreq.py
|
|
|
|
${PYSITELIB}/chardet/euckrfreq.pyc
|
|
|
|
${PYSITELIB}/chardet/euckrfreq.pyo
|
|
|
|
${PYSITELIB}/chardet/euckrprober.py
|
|
|
|
${PYSITELIB}/chardet/euckrprober.pyc
|
|
|
|
${PYSITELIB}/chardet/euckrprober.pyo
|
|
|
|
${PYSITELIB}/chardet/euctwfreq.py
|
|
|
|
${PYSITELIB}/chardet/euctwfreq.pyc
|
|
|
|
${PYSITELIB}/chardet/euctwfreq.pyo
|
|
|
|
${PYSITELIB}/chardet/euctwprober.py
|
|
|
|
${PYSITELIB}/chardet/euctwprober.pyc
|
|
|
|
${PYSITELIB}/chardet/euctwprober.pyo
|
|
|
|
${PYSITELIB}/chardet/gb2312freq.py
|
|
|
|
${PYSITELIB}/chardet/gb2312freq.pyc
|
|
|
|
${PYSITELIB}/chardet/gb2312freq.pyo
|
|
|
|
${PYSITELIB}/chardet/gb2312prober.py
|
|
|
|
${PYSITELIB}/chardet/gb2312prober.pyc
|
|
|
|
${PYSITELIB}/chardet/gb2312prober.pyo
|
|
|
|
${PYSITELIB}/chardet/hebrewprober.py
|
|
|
|
${PYSITELIB}/chardet/hebrewprober.pyc
|
|
|
|
${PYSITELIB}/chardet/hebrewprober.pyo
|
|
|
|
${PYSITELIB}/chardet/jisfreq.py
|
|
|
|
${PYSITELIB}/chardet/jisfreq.pyc
|
|
|
|
${PYSITELIB}/chardet/jisfreq.pyo
|
py-chardet: update to 5.0.0.
This release is the first release of chardet that no longer
supports Python < 3.6.
In addition to that change, it features the following user-facing
changes:
Added a prober for Johab Korean (#207, @grizlupo)
Added a prober for UTF-16/32 BE/LE (#109, #206, @jpz)
Added test data for Croatian, Czech, Hungarian, Polish, Slovak,
Slovene, Greek, and Turkish, which should help prevent future
errors with those languages
Improved XML tag filtering, which should improve accuracy for
XML files (#208)
Tweaked SingleByteCharSetProber confidence to match latest
uchardet (#209)
Made detect_all return child prober confidences (#210)
Updated examples in docs (#223, @domdfcoding)
Documentation fixes (#212, #224, #225, #226, #220, #221, #244
from too many to mention)
Minor performance improvements (#252, @deedy5)
Add support for Python 3.10 when testing (#232, @jdufresne)
Lots of little development cycle improvements, mostly thanks
to @jdufresne
2022-07-03 14:45:17 +02:00
|
|
|
${PYSITELIB}/chardet/johabfreq.py
|
|
|
|
${PYSITELIB}/chardet/johabfreq.pyc
|
|
|
|
${PYSITELIB}/chardet/johabfreq.pyo
|
|
|
|
${PYSITELIB}/chardet/johabprober.py
|
|
|
|
${PYSITELIB}/chardet/johabprober.pyc
|
|
|
|
${PYSITELIB}/chardet/johabprober.pyo
|
2008-10-08 11:38:46 +02:00
|
|
|
${PYSITELIB}/chardet/jpcntx.py
|
|
|
|
${PYSITELIB}/chardet/jpcntx.pyc
|
|
|
|
${PYSITELIB}/chardet/jpcntx.pyo
|
|
|
|
${PYSITELIB}/chardet/langbulgarianmodel.py
|
|
|
|
${PYSITELIB}/chardet/langbulgarianmodel.pyc
|
|
|
|
${PYSITELIB}/chardet/langbulgarianmodel.pyo
|
|
|
|
${PYSITELIB}/chardet/langgreekmodel.py
|
|
|
|
${PYSITELIB}/chardet/langgreekmodel.pyc
|
|
|
|
${PYSITELIB}/chardet/langgreekmodel.pyo
|
|
|
|
${PYSITELIB}/chardet/langhebrewmodel.py
|
|
|
|
${PYSITELIB}/chardet/langhebrewmodel.pyc
|
|
|
|
${PYSITELIB}/chardet/langhebrewmodel.pyo
|
|
|
|
${PYSITELIB}/chardet/langhungarianmodel.py
|
|
|
|
${PYSITELIB}/chardet/langhungarianmodel.pyc
|
|
|
|
${PYSITELIB}/chardet/langhungarianmodel.pyo
|
2021-01-04 12:53:14 +01:00
|
|
|
${PYSITELIB}/chardet/langrussianmodel.py
|
|
|
|
${PYSITELIB}/chardet/langrussianmodel.pyc
|
|
|
|
${PYSITELIB}/chardet/langrussianmodel.pyo
|
2008-10-08 11:38:46 +02:00
|
|
|
${PYSITELIB}/chardet/langthaimodel.py
|
|
|
|
${PYSITELIB}/chardet/langthaimodel.pyc
|
|
|
|
${PYSITELIB}/chardet/langthaimodel.pyo
|
Updated py-chardet to 3.0.2.
chardet 3.0.2
Fixes an issue where detect would sometimes return None instead of a dict with the keys encoding, language, and confidence (Issue #113, PR #114).
chardet 3.0.1
This bugfix release fixes a crash in the EUC-TW prober when it encountered certain strings (Issue #67).
chardet 3.0.0
This release is long overdue, but still mostly serves as a placeholder
for the impending 4.0.0 release, which will have retrained models
for better accuracy. For now, this release will get the following
improvements up on PyPI:
Added support for Turkish ISO-8859-9 detection (PR #41, thanks @queeup)
Commented out large unused sections of Big5 and EUC-KR tables to save memory (8bc4b89)
Removed Python 3.2 from testing, but add 3.4 - 3.6
Ensure that stdin is open with mode 'rb' for chardetect CLI. (PR #38, thanks @lpsinger)
Fixed chardetect crash with non-ascii file names (PR #39, thanks @nkanaev)
Made naming conventions more Pythonic throughout (no more mTypicalPositiveRatio, and instead typical_positive_ratio)
Modernized test scripts and infrastructure so we've got Travis testing and all that stuff
Rename filter_without_english_words to filter_international_words and make it match current Mozilla implementation (PR #44, thanks @rsnair2)
Updated filter_english_letters to match C implementation (c665459)
Temporarily disabled Hungarian ISO-8859-2 and Windows-1250 detection because it is very inaccurate (da6c0a0)
Allow CLI sub-package to be importable (PR #55)
Add a hypotheis-based test (PR #66, thanks @DRMacIver)
Strip endianness from UTF with BOM predictions so that the encoding can be passed directly to bytes.decode() (PR #73, thanks @snoack)
Fixed broken links in docs (PR #90, thanks @roskakori)
Added early exit to chardetect when encoding is detected instead of looping through entire file (PR #103, thanks @jpz)
Use bytearray objects internally instead of wrap_ord calls, which provides a nice performance boost across the board (PR #106)
Add language property to probers and UniversalDetector results (PR #180)
Mark the 5 known test failures as such so we can have more useful Travis build results in the meantime (d588407)
2017-04-19 19:24:16 +02:00
|
|
|
${PYSITELIB}/chardet/langturkishmodel.py
|
|
|
|
${PYSITELIB}/chardet/langturkishmodel.pyc
|
|
|
|
${PYSITELIB}/chardet/langturkishmodel.pyo
|
2008-10-08 11:38:46 +02:00
|
|
|
${PYSITELIB}/chardet/latin1prober.py
|
|
|
|
${PYSITELIB}/chardet/latin1prober.pyc
|
|
|
|
${PYSITELIB}/chardet/latin1prober.pyo
|
|
|
|
${PYSITELIB}/chardet/mbcharsetprober.py
|
|
|
|
${PYSITELIB}/chardet/mbcharsetprober.pyc
|
|
|
|
${PYSITELIB}/chardet/mbcharsetprober.pyo
|
|
|
|
${PYSITELIB}/chardet/mbcsgroupprober.py
|
|
|
|
${PYSITELIB}/chardet/mbcsgroupprober.pyc
|
|
|
|
${PYSITELIB}/chardet/mbcsgroupprober.pyo
|
|
|
|
${PYSITELIB}/chardet/mbcssm.py
|
|
|
|
${PYSITELIB}/chardet/mbcssm.pyc
|
|
|
|
${PYSITELIB}/chardet/mbcssm.pyo
|
2021-01-04 12:53:14 +01:00
|
|
|
${PYSITELIB}/chardet/metadata/__init__.py
|
|
|
|
${PYSITELIB}/chardet/metadata/__init__.pyc
|
|
|
|
${PYSITELIB}/chardet/metadata/__init__.pyo
|
|
|
|
${PYSITELIB}/chardet/metadata/languages.py
|
|
|
|
${PYSITELIB}/chardet/metadata/languages.pyc
|
|
|
|
${PYSITELIB}/chardet/metadata/languages.pyo
|
2008-10-08 11:38:46 +02:00
|
|
|
${PYSITELIB}/chardet/sbcharsetprober.py
|
|
|
|
${PYSITELIB}/chardet/sbcharsetprober.pyc
|
|
|
|
${PYSITELIB}/chardet/sbcharsetprober.pyo
|
|
|
|
${PYSITELIB}/chardet/sbcsgroupprober.py
|
|
|
|
${PYSITELIB}/chardet/sbcsgroupprober.pyc
|
|
|
|
${PYSITELIB}/chardet/sbcsgroupprober.pyo
|
|
|
|
${PYSITELIB}/chardet/sjisprober.py
|
|
|
|
${PYSITELIB}/chardet/sjisprober.pyc
|
|
|
|
${PYSITELIB}/chardet/sjisprober.pyo
|
|
|
|
${PYSITELIB}/chardet/universaldetector.py
|
|
|
|
${PYSITELIB}/chardet/universaldetector.pyc
|
|
|
|
${PYSITELIB}/chardet/universaldetector.pyo
|
py-chardet: update to 5.0.0.
This release is the first release of chardet that no longer
supports Python < 3.6.
In addition to that change, it features the following user-facing
changes:
Added a prober for Johab Korean (#207, @grizlupo)
Added a prober for UTF-16/32 BE/LE (#109, #206, @jpz)
Added test data for Croatian, Czech, Hungarian, Polish, Slovak,
Slovene, Greek, and Turkish, which should help prevent future
errors with those languages
Improved XML tag filtering, which should improve accuracy for
XML files (#208)
Tweaked SingleByteCharSetProber confidence to match latest
uchardet (#209)
Made detect_all return child prober confidences (#210)
Updated examples in docs (#223, @domdfcoding)
Documentation fixes (#212, #224, #225, #226, #220, #221, #244
from too many to mention)
Minor performance improvements (#252, @deedy5)
Add support for Python 3.10 when testing (#232, @jdufresne)
Lots of little development cycle improvements, mostly thanks
to @jdufresne
2022-07-03 14:45:17 +02:00
|
|
|
${PYSITELIB}/chardet/utf1632prober.py
|
|
|
|
${PYSITELIB}/chardet/utf1632prober.pyc
|
|
|
|
${PYSITELIB}/chardet/utf1632prober.pyo
|
2008-10-08 11:38:46 +02:00
|
|
|
${PYSITELIB}/chardet/utf8prober.py
|
|
|
|
${PYSITELIB}/chardet/utf8prober.pyc
|
|
|
|
${PYSITELIB}/chardet/utf8prober.pyo
|
Updated py-chardet to 3.0.2.
chardet 3.0.2
Fixes an issue where detect would sometimes return None instead of a dict with the keys encoding, language, and confidence (Issue #113, PR #114).
chardet 3.0.1
This bugfix release fixes a crash in the EUC-TW prober when it encountered certain strings (Issue #67).
chardet 3.0.0
This release is long overdue, but still mostly serves as a placeholder
for the impending 4.0.0 release, which will have retrained models
for better accuracy. For now, this release will get the following
improvements up on PyPI:
Added support for Turkish ISO-8859-9 detection (PR #41, thanks @queeup)
Commented out large unused sections of Big5 and EUC-KR tables to save memory (8bc4b89)
Removed Python 3.2 from testing, but add 3.4 - 3.6
Ensure that stdin is open with mode 'rb' for chardetect CLI. (PR #38, thanks @lpsinger)
Fixed chardetect crash with non-ascii file names (PR #39, thanks @nkanaev)
Made naming conventions more Pythonic throughout (no more mTypicalPositiveRatio, and instead typical_positive_ratio)
Modernized test scripts and infrastructure so we've got Travis testing and all that stuff
Rename filter_without_english_words to filter_international_words and make it match current Mozilla implementation (PR #44, thanks @rsnair2)
Updated filter_english_letters to match C implementation (c665459)
Temporarily disabled Hungarian ISO-8859-2 and Windows-1250 detection because it is very inaccurate (da6c0a0)
Allow CLI sub-package to be importable (PR #55)
Add a hypotheis-based test (PR #66, thanks @DRMacIver)
Strip endianness from UTF with BOM predictions so that the encoding can be passed directly to bytes.decode() (PR #73, thanks @snoack)
Fixed broken links in docs (PR #90, thanks @roskakori)
Added early exit to chardetect when encoding is detected instead of looping through entire file (PR #103, thanks @jpz)
Use bytearray objects internally instead of wrap_ord calls, which provides a nice performance boost across the board (PR #106)
Add language property to probers and UniversalDetector results (PR #180)
Mark the 5 known test failures as such so we can have more useful Travis build results in the meantime (d588407)
2017-04-19 19:24:16 +02:00
|
|
|
${PYSITELIB}/chardet/version.py
|
|
|
|
${PYSITELIB}/chardet/version.pyc
|
|
|
|
${PYSITELIB}/chardet/version.pyo
|