Switch to unified view
a/src/filters/rcllatinclass.py | b/src/filters/rcllatinclass.py | ||
---|---|---|---|
1 | #!/usr/bin/env python2
|
1 | #!/usr/bin/env python3
|
2 | """Try to guess a text's language and character set by checking how it matches lists of |
2 | """Try to guess a text's language and character set by checking how it matches lists of |
3 | common words. This is not a primary method of detection because it's slow and unreliable, but it |
3 | common words. This is not a primary method of detection because it's slow and unreliable, but it |
4 | may be a help in discrimating, for exemple, before european languages using relatively close |
4 | may be a help in discrimating, for exemple, before european languages using relatively close |
5 | variations of iso-8859. |
5 | variations of iso-8859. |
6 | This is used in association with a zip file containing a number of stopwords list: rcllatinstops.zip |
6 | This is used in association with a zip file containing a number of stopwords list: rcllatinstops.zip |