OXIESEC PANEL
- Current Dir:
/
/
opt
/
gsutil
/
third_party
/
chardet
Server IP: 2a02:4780:11:1594:0:ef5:22d7:a
Upload:
Create Dir:
Name
Size
Modified
Perms
📁
..
-
12/09/2024 05:26:03 PM
rwxr-xr-x
📄
.git-blame-ignore-revs
84 bytes
08/01/2023 07:12:12 PM
rw-r--r--
📄
.gitattributes
14 bytes
08/01/2023 07:12:12 PM
rw-r--r--
📁
.github
-
08/01/2023 07:12:12 PM
rwxr-xr-x
📄
.gitignore
81 bytes
08/01/2023 07:12:12 PM
rw-r--r--
📄
.pre-commit-config.yaml
820 bytes
08/01/2023 07:12:12 PM
rw-r--r--
📄
.prospector.yaml
296 bytes
08/01/2023 07:12:12 PM
rw-r--r--
📄
LICENSE
25.91 KB
08/01/2023 07:12:12 PM
rw-r--r--
📄
MANIFEST.in
195 bytes
08/01/2023 07:12:12 PM
rw-r--r--
📄
NOTES.rst
3.68 KB
08/01/2023 07:12:12 PM
rw-r--r--
📄
README.rst
1.96 KB
08/01/2023 07:12:12 PM
rw-r--r--
📄
bench.py
4.79 KB
08/01/2023 07:12:12 PM
rw-r--r--
📁
chardet
-
02/11/2025 08:19:49 AM
rwxr-xr-x
📄
convert_language_model.py
9.11 KB
08/01/2023 07:12:12 PM
rw-r--r--
📁
docs
-
08/01/2023 07:12:12 PM
rwxr-xr-x
📄
pyproject.toml
81 bytes
08/01/2023 07:12:12 PM
rw-r--r--
📄
setup.cfg
1.55 KB
08/01/2023 07:12:12 PM
rw-r--r--
📄
test.py
7.95 KB
08/01/2023 07:12:12 PM
rw-r--r--
📁
tests
-
08/01/2023 07:12:12 PM
rwxr-xr-x
Editing: README.rst
Close
Chardet: The Universal Character Encoding Detector -------------------------------------------------- .. image:: https://img.shields.io/travis/chardet/chardet/stable.svg :alt: Build status :target: https://travis-ci.org/chardet/chardet .. image:: https://img.shields.io/coveralls/chardet/chardet/stable.svg :target: https://coveralls.io/r/chardet/chardet .. image:: https://img.shields.io/pypi/v/chardet.svg :target: https://warehouse.python.org/project/chardet/ :alt: Latest version on PyPI .. image:: https://img.shields.io/pypi/l/chardet.svg :alt: License Detects - ASCII, UTF-8, UTF-16 (2 variants), UTF-32 (4 variants) - Big5, GB2312, EUC-TW, HZ-GB-2312, ISO-2022-CN (Traditional and Simplified Chinese) - EUC-JP, SHIFT_JIS, CP932, ISO-2022-JP (Japanese) - EUC-KR, ISO-2022-KR, Johab (Korean) - KOI8-R, MacCyrillic, IBM855, IBM866, ISO-8859-5, windows-1251 (Cyrillic) - ISO-8859-5, windows-1251 (Bulgarian) - ISO-8859-1, windows-1252, MacRoman (Western European languages) - ISO-8859-7, windows-1253 (Greek) - ISO-8859-8, windows-1255 (Visual and Logical Hebrew) - TIS-620 (Thai) .. note:: Our ISO-8859-2 and windows-1250 (Hungarian) probers have been temporarily disabled until we can retrain the models. Requires Python 3.7+. Installation ------------ Install from `PyPI <https://pypi.org/project/chardet/>`_:: pip install chardet Documentation ------------- For users, docs are now available at https://chardet.readthedocs.io/. Command-line Tool ----------------- chardet comes with a command-line script which reports on the encodings of one or more files:: % chardetect somefile someotherfile somefile: windows-1252 with confidence 0.5 someotherfile: ascii with confidence 1.0 About ----- This is a continuation of Mark Pilgrim's excellent original chardet port from C, and `Ian Cordasco <https://github.com/sigmavirus24>`_'s `charade <https://github.com/sigmavirus24/charade>`_ Python 3-compatible fork. :maintainer: Dan Blanchard