IPB
>  Man Pages > Linux > openSUSE 10.2 > Section 3 > Mail::SpamAssassin::Plugin::TextCat man page

Mail::SpamAssassin::Plugin::TextCat man page

Section 3 - openSUSE 10.2 Man Pages

Other operating system man pages available here


Advanced Search

Hopefully, this page is exactly what you are looking for, but if not, you can always find further assistance on Unix/Linux Forum!


Mail::SpamAssassin::PlUser:Contributed PMail::SpamAssassin::Plugin::TextCat(3)



NAME
       Mail::SpamAssassin::Plugin::TextCat - TextCat language guesser

SYNOPSIS
         loadplugin     Mail::SpamAssassin::Plugin::TextCat

DESCRIPTION
       This plugin will try to guess the language used in the message text.

       You can then specify which languages are considered okay for incoming
       mail and if the guessed language is not okay, "UNWANTED_LANGUAGE_BODY"
       is triggered

       It will always add the results to a "X-Language" name-value pair in the
       message metadata data structure.  This may be useful as Bayes tokens
       and can be added to marked-up messages using "add_header".

       Note: the language cannot always be recognized with sufficient confi-
       dence.  In that case, "UNWANTED_LANGUAGE_BODY" will not trigger.

USER OPTIONS
       ok_languages xx [ yy zz ... ]      (default: all)
           This option is used to specify which languages are considered okay
           for incoming mail.  SpamAssassin will try to detect the language
           used in the message text.

           Note that the language cannot always be recognized with sufficient
           confidence.  In that case, no points will be assigned.

           The rule "UNWANTED_LANGUAGE_BODY" is triggered based on how this is
           set.

           In your configuration, you must use the two or three letter lan-
           guage specifier in lowercase, not the English name for the lan-
           guage.  You may also specify "all" if a desired language is not
           listed, or if you want to allow any language.  The default setting
           is "all".

           Examples:

             ok_languages all         (allow all languages)
             ok_languages en          (only allow English)
             ok_languages en ja zh    (allow English, Japanese, and Chinese)

           Note: if there are multiple ok_languages lines, only the last one
           is used.

           Select the languages to allow from the list below:

           af   - Afrikaans
           am   - Amharic
           ar   - Arabic
           be   - Byelorussian
           bg   - Bulgarian
           bs   - Bosnian
           ca   - Catalan
           cs   - Czech
           cy   - Welsh
           da   - Danish
           de   - German
           el   - Greek
           en   - English
           eo   - Esperanto
           es   - Spanish
           et   - Estonian
           eu   - Basque
           fa   - Persian
           fi   - Finnish
           fr   - French
           fy   - Frisian
           ga   - Irish Gaelic
           gd   - Scottish Gaelic
           he   - Hebrew
           hi   - Hindi
           hr   - Croatian
           hu   - Hungarian
           hy   - Armenian
           id   - Indonesian
           is   - Icelandic
           it   - Italian
           ja   - Japanese
           ka   - Georgian
           ko   - Korean
           la   - Latin
           lt   - Lithuanian
           lv   - Latvian
           mr   - Marathi
           ms   - Malay
           ne   - Nepali
           nl   - Dutch
           no   - Norwegian
           pl   - Polish
           pt   - Portuguese
           qu   - Quechua
           rm   - Rhaeto-Romance
           ro   - Romanian
           ru   - Russian
           sa   - Sanskrit
           sco  - Scots
           sk   - Slovak
           sl   - Slovenian
           sq   - Albanian
           sr   - Serbian
           sv   - Swedish
           sw   - Swahili
           ta   - Tamil
           th   - Thai
           tl   - Tagalog
           tr   - Turkish
           uk   - Ukrainian
           vi   - Vietnamese
           yi   - Yiddish
           zh   - Chinese (both Traditional and Simplified)
           zh.big5   - Chinese (Traditional only)
           zh.gb2312 - Chinese (Simplified only)



       inactive_languages xx [ yy zz ... ]          (default: see below)
           This option is used to specify which languages will not be consid-
           ered when trying to guess the language.  For performance reasons,
           supported languages that have fewer than about 5 million speakers
           are disabled by default.  Note that listing a language in "ok_lan-
           guages" automatically enables it for that user.

           The default setting is:

           bs cy eo et eu fy ga gd is la lt lv rm sa sco sl yi

           That list is Bosnian, Welsh, Esperanto, Estonian, Basque, Frisian,
           Irish Gaelic, Scottish Gaelic, Icelandic, Latin, Lithuanian, Lat-
           vian, Rhaeto-Romance, Sanskrit, Scots, Slovenian, and Yiddish.

       textcat_max_languages N (default: 5)
           The maximum number of languages before the classification is con-
           sidered unknown.

       textcat_optimal_ngrams N (default: 0)
           If the number of ngrams is lower than this number then they will be
           removed.  This can be used to speed up the program for longer
           inputs.  For shorter inputs, this should be set to 0.

       textcat_max_ngrams N (default: 400)
           The maximum number of ngrams that should be compared with each of
           the languages models (note that each of those models is used com-
           pletely).

       textcat_acceptable_score N (default: 1.05)
           Include any language that scores at least "textcat_accept-
           able_score" in the returned list of languages



perl v5.8.8                       2006-0Mail::SpamAssassin::Plugin::TextCat(3)


Man(1) output converted with man2html and wrapped by fishsponge

This page was generated on Sat Sep 8 16:37:24 GMT 2007

Your favourite pages:

No pages logged yet.
Trying to save cookie...

Top 10 most popular pages:

svn man page (6161 hits)
(FreeBSD 6.2)

sqlite3 man page (5598 hits)
(openSUSE 10.2)

adv_cap_autoneg man page (5045 hits)
(Solaris 10 11_06)

CPAN man page (4791 hits)
(Suse Linux 10.1)

ssh man page (4439 hits)
(Suse Linux 10.1)

ssh-socks5-proxy-connect man page (3525 hits)
(Solaris 10 11_06)

signal man page (3394 hits)
(Suse Linux 10.1)

netcat man page (3373 hits)
(Suse Linux 10.1)

pprosetup man page (2886 hits)
(Solaris 10 11_06)

startproc man page (2738 hits)
(Suse Linux 10.1)

Useful Links

Go Back

Visitor Statistics


Valid XHTML 1.0 Transitional     Valid CSS!

Partners: Cambridge Plus :: Pyrenees Food :: Robust Foot Switch :: <Link Available>
Unix Man Pages / Linux Man Pages :: HiFi Forum :: SIP VoIP Phone & Provider Reviews :: UNIX/Linux Forum Archives

More info on advertising on Unix/Linux Forum