AplikaceAplikace
Nastavení

Rozdíly

Zde můžete vidět rozdíly mezi vybranou verzí a aktuální verzí dané stránky.

Odkaz na výstup diff

Obě strany předchozí revizePředchozí verze
Následující verze
Předchozí verze
seznamy:tagery [2015/02/03 20:22] Alexandr Rosenseznamy:tagery [2022/09/29 14:17] (aktuální) Jan Křivan
Řádek 1: Řádek 1:
 ====== Taggers & lemmatizers ====== ====== Taggers & lemmatizers ======
 +
 +Notice: The list is not kept up to date (last update 2/2015).
  
 ^ ^[[ http://staffwww.dcs.shef.ac.uk/people/A.Aker/activityNLPProjects.html|Ahmet Akker]] (tool)^[[ https://svn.code.sf.net/p/apertium/svn/languages/|Apertium]] (tool)^[[ http://clcl.unige.ch/btag/|BTTtagger]] (tool)^[[ http://ufal.mff.cuni.cz/compost/|COMPOST]] (tool)^[[http://nlp.lsi.upc.edu/freeling/|Freeling]] (tool)^[[ https://tech.yandex.ru/mystem/|MYSTEM]] (tool)^[[ http://www.nltk.org|NLTK]] (tool)^[[ http://rdrpostagger.sourceforge.net|RDRPOSTagger]] (tool)^[[ http://www.cis.uni-muenchen.de/~schmid/tools/RFTagger/|RFTagger]] (tool)^[[ http://nlp.stanford.edu/software/tagger.shtml|Stanford]] (tool)^[[ http://www.cis.uni-muenchen.de/~schmid/tools/TreeTagger/|Treetagger]] (tool)^Other ^ ^ ^[[ http://staffwww.dcs.shef.ac.uk/people/A.Aker/activityNLPProjects.html|Ahmet Akker]] (tool)^[[ https://svn.code.sf.net/p/apertium/svn/languages/|Apertium]] (tool)^[[ http://clcl.unige.ch/btag/|BTTtagger]] (tool)^[[ http://ufal.mff.cuni.cz/compost/|COMPOST]] (tool)^[[http://nlp.lsi.upc.edu/freeling/|Freeling]] (tool)^[[ https://tech.yandex.ru/mystem/|MYSTEM]] (tool)^[[ http://www.nltk.org|NLTK]] (tool)^[[ http://rdrpostagger.sourceforge.net|RDRPOSTagger]] (tool)^[[ http://www.cis.uni-muenchen.de/~schmid/tools/RFTagger/|RFTagger]] (tool)^[[ http://nlp.stanford.edu/software/tagger.shtml|Stanford]] (tool)^[[ http://www.cis.uni-muenchen.de/~schmid/tools/TreeTagger/|Treetagger]] (tool)^Other ^
 ^Arabic| |  x  | | | | | | | |  x  | |[[ http://nlp.ldeo.columbia.edu/madamira/|Madamira]] (web, tool)| ^Arabic| |  x  | | | | | | | |  x  | |[[ http://nlp.ldeo.columbia.edu/madamira/|Madamira]] (web, tool)|
-^Asturian| | | | |  x  | | | | | | | |+^Asturian| |  x  | | |  x  | | | | | | | |
 ^Belarusian| | | | | |  x  | | | | | | | ^Belarusian| | | | | |  x  | | | | | | |
 ^Bengali| | | | | | |  x  | | | | | | ^Bengali| | | | | | |  x  | | | | | |
Řádek 11: Řádek 13:
 ^Croatian| |  x  |  x  | | | | | | | | |[[ http://nlp.ffzg.hr/resources/models/tagging/|Nikola Ljubešić]] (tool)| ^Croatian| |  x  |  x  | | | | | | | | |[[ http://nlp.ffzg.hr/resources/models/tagging/|Nikola Ljubešić]] (tool)|
 ^Czech| | |  x  |  x  | | | |  x  |  x  | | |[[ http://ufal.mff.cuni.cz/morphodita|MorphoDiTa]] (tool)| ^Czech| | |  x  |  x  | | | |  x  |  x  | | |[[ http://ufal.mff.cuni.cz/morphodita|MorphoDiTa]] (tool)|
-^Danish| | | | | | | |  x  | | | |[[ https://mlnl.net/jg/software/bnl/|CST]] (web, tool)|+^Danish| |  x  | | | | | |  x  | | | |[[ https://mlnl.net/jg/software/bnl/|CST]] (web, tool)|
 ^Dutch|  x  |  x  | |  x  | | |  x  |  x  | | |  __x__  |[[ https://mlnl.net/jg/software/bnl/|Brill-NL]] (web, tool), [[http://ilk.uvt.nl/frog/|Frog]] (tool)| ^Dutch|  x  |  x  | |  x  | | |  x  |  x  | | |  __x__  |[[ https://mlnl.net/jg/software/bnl/|Brill-NL]] (web, tool), [[http://ilk.uvt.nl/frog/|Frog]] (tool)|
 ^English|  x  | |  x  |  x  |  x  | | | | |  x  |  __x__  |[[ http://ufal.mff.cuni.cz/morphodita|MorphoDiTa]] (tool)| ^English|  x  | |  x  |  x  |  x  | | | | |  x  |  __x__  |[[ http://ufal.mff.cuni.cz/morphodita|MorphoDiTa]] (tool)|
 ^Estonian| | |  x  | | | | | | | |  __x__  | | ^Estonian| | |  x  | | | | | | | |  __x__  | |
 ^Finnish| | | | | | | | | | |  x  |[[ https://github.com/TurkuNLP/Finnish-dep-parser|OMorFi]] (tool)| ^Finnish| | | | | | | | | | |  x  |[[ https://github.com/TurkuNLP/Finnish-dep-parser|OMorFi]] (tool)|
-^French|  x  | |  x  | |  x  | | |  x  | |  x  |  __x__  | | +^French|  x  |  x  |  x  | |  x  | | |  x  | |  x  |  __x__  | | 
-^Galician| | | | |  x  | | | | | |  x  | |+^Galician| |  x  | | |  x  | | | | | |  x  | |
 ^German|  x  | | | | | | |  x  |  __x__  |  x  |  x  | | ^German|  x  | | | | | | |  x  |  __x__  |  x  |  x  | |
 ^Greek| | | | | | | | | | | |[[ http://nlp.ilsp.gr/ws/|ILSP]] (web)| ^Greek| | | | | | | | | | | |[[ http://nlp.ilsp.gr/ws/|ILSP]] (web)|
-^Hebrew| | | | | | | | | | | |[[ http://www.mila.cs.technion.ac.il|MILA]] (tool)|+^Hebrew| |  x  | | | | | | | | | |[[ http://www.mila.cs.technion.ac.il|MILA]] (tool)|
 ^Hindi| |  x  | | | | |  x  |  x  | | | |[[ http://sivareddy.in/downloads#indian_language_tools|Siva Reddy]] (tool), [[ http://ltrc.iiit.ac.in/analyzer/hindi/|Hindi Shallow Parser]] (web)| ^Hindi| |  x  | | | | |  x  |  x  | | | |[[ http://sivareddy.in/downloads#indian_language_tools|Siva Reddy]] (tool), [[ http://ltrc.iiit.ac.in/analyzer/hindi/|Hindi Shallow Parser]] (web)|
 ^Hungarian| | |  x  | | | | | |  x  | | |__[[ http://code.google.com/p/hunpos/|hunpos]]__ (tool)| ^Hungarian| | |  x  | | | | | |  x  | | |__[[ http://code.google.com/p/hunpos/|hunpos]]__ (tool)|
-^Icelandic| | | |  x  | | | | | | | |__[[ http://www.ling.su.se/english/nlp/tools/stagger/stagger-the-stockholm-tagger-1.98986|IceStagger]]__ (tool)|+^Icelandic| |  x  | |  x  | | | | | | | |__[[ http://www.ling.su.se/english/nlp/tools/stagger/stagger-the-stockholm-tagger-1.98986|IceStagger]]__ (tool)|
 ^Indonesian| | | | | | |  x  | | | | | | ^Indonesian| | | | | | |  x  | | | | | |
-^Italian|  x  | |  x  | |  x  | | |  x  | | |  __x__  | |+^Italian|  x  |  x  |  x  | |  x  | | |  x  | | |  __x__  | |
 ^Japanese| | | | | | |  x  | | | | |[[ https://code.google.com/p/mecab/|mecab]] (tool)| ^Japanese| | | | | | |  x  | | | | |[[ https://code.google.com/p/mecab/|mecab]] (tool)|
 ^Lao| | | | | | | |  x  | | | | | ^Lao| | | | | | | |  x  | | | | |
Řádek 33: Řádek 35:
 ^Marathi| | | | | | |  x  | | | | | | ^Marathi| | | | | | |  x  | | | | | |
 ^Mongolian| | | | | | | | | | |  x  | | ^Mongolian| | | | | | | | | | |  x  | |
-^Norwegian| | | | | | | | | | | |__[[ http://tekstlab.uio.no/obt-ny/index.html|obt]]__ (tool)|+^Norwegian| |  x  | | | | | | | | | |__[[ http://tekstlab.uio.no/obt-ny/index.html|obt]]__ (tool)|
 ^Persian| | | | | | | | | | | |[[ https://mlnl.net/jg/software/bnl/|hazm]] (tool)| ^Persian| | | | | | | | | | | |[[ https://mlnl.net/jg/software/bnl/|hazm]] (tool)|
 ^Polish| | |  x  | | | |  x  | | | |  x  |__[[ http://nlp.pwr.wroc.pl/takipi/|TaKIPI]]__, [[ http://zil.ipipan.waw.pl/PANTERA|Pantera]], [[http://zil.ipipan.waw.pl/Concraft|Concraft]], [[http://nlp.pwr.wroc.pl/redmine/projects/wcrft/wiki|WCRFT]] (tools)((For more tools see [[http://clip.ipipan.waw.pl/LRT|Language Tools and Resources for Polish]].)) | ^Polish| | |  x  | | | |  x  | | | |  x  |__[[ http://nlp.pwr.wroc.pl/takipi/|TaKIPI]]__, [[ http://zil.ipipan.waw.pl/PANTERA|Pantera]], [[http://zil.ipipan.waw.pl/Concraft|Concraft]], [[http://nlp.pwr.wroc.pl/redmine/projects/wcrft/wiki|WCRFT]] (tools)((For more tools see [[http://clip.ipipan.waw.pl/LRT|Language Tools and Resources for Polish]].)) |
-^Portuguese| | | | |  x  | |  x  |  x  | | |  __x__  | |+^Portuguese| |  x  | | |  x  | |  x  |  x  | | |  __x__  | |
 ^Romanian| |  x  |  x  | | | | | | | | |[[ http://www.racai.ro/tools/text/|RACAI]] (web)| ^Romanian| |  x  |  x  | | | | | | | | |[[ http://www.racai.ro/tools/text/|RACAI]] (web)|
-^Russian| | | | |  x  |  x  | | |  x  | |  __x__  | |+^Russian| |  x  | | |  x  |  x  | | |  x  | |  __x__  | |
 ^Serbian| |  x  |  x  | | | | | | | | |[[ http://nlp.ffzg.hr/resources/models/tagging/|Nikola Ljubešić]] (tool)| ^Serbian| |  x  |  x  | | | | | | | | |[[ http://nlp.ffzg.hr/resources/models/tagging/|Nikola Ljubešić]] (tool)|
 ^Slovak| | | | | | | | |  x  | |  x  |__[[ http://ufal.mff.cuni.cz/morce/index.php|Morče]]__ (tool)| ^Slovak| | | | | | | | |  x  | |  x  |__[[ http://ufal.mff.cuni.cz/morce/index.php|Morče]]__ (tool)|
 ^Slovene| | |  x  | | | | | |  x  | | |__[[ http://nl.ijs.si/analyse/|ToTaLe]]__ (tool)| ^Slovene| | |  x  | | | | | |  x  | | |__[[ http://nl.ijs.si/analyse/|ToTaLe]]__ (tool)|
-^Spanish|  x  | | | |  x  | |  x  |  x  | | |  __x__  | |+^Spanish|  x  |  x  | | |  x  | |  x  |  x  | | |  __x__  | |
 ^Swahili| | | | | | | | | | |  x  | | ^Swahili| | | | | | | | | | |  x  | |
-^Swedish| | | |  x  | | | |  x  | | | |__[[ http://www.ling.su.se/english/nlp/tools/stagger/stagger-the-stockholm-tagger-1.98986|Stagger]]__ (tool)|+^Swedish| |  x  | |  x  | | | |  x  | | | |__[[ http://www.ling.su.se/english/nlp/tools/stagger/stagger-the-stockholm-tagger-1.98986|Stagger]]__ (tool)|
 ^Telugu| | | | | | |  x  | | | | | | ^Telugu| | | | | | |  x  | | | | | |
 ^Thai| | | | | | | |  x  | | | | | ^Thai| | | | | | | |  x  | | | | |
Řádek 50: Řádek 52:
 ^Ukrainian| |  x  | | | |  x  | | | | | |[[ http://ugtag.sourceforge.net|ugtag]] (tool)| ^Ukrainian| |  x  | | | |  x  | | | | | |[[ http://ugtag.sourceforge.net|ugtag]] (tool)|
 ^Vietnamese| | | | | | | |  x  | | | |[[ http://mim.hus.vnu.edu.vn/phuonglh/softwares|vnTagger]], [[ http://vlsp.vietlp.org:8080/demo/?&lang=en|Vietnamese Language and Speech Processing (VLSP) / VietTagger]] (tools)| ^Vietnamese| | | | | | | |  x  | | | |[[ http://mim.hus.vnu.edu.vn/phuonglh/softwares|vnTagger]], [[ http://vlsp.vietlp.org:8080/demo/?&lang=en|Vietnamese Language and Speech Processing (VLSP) / VietTagger]] (tools)|
-^Welsh| | | | |  x  | | | | | | | |+^Welsh| |  x  | | |  x  | | | | | | | |
  
  
-<WRAP round info 75%> +<fs x-small>Note: The list does not include tools without a disambiguation component, such as morphological analyzers  [[http://nlp.fi.muni.cz/projekty/ajka/ajkacz.htm|Ajka]] or [[http://nlp.fi.muni.cz/czech-morphology-analyser/|Majka]].</fs>
-For additional resources see Wiki of the Association for Computational Linguistics – [[http://www.aclweb.org/aclwiki/index.php?title=List_of_resources_by_language|List of resources by language]].+
  
-Tools of varied coverage for more languages may be found at [[https://languagetool.org]]. 
  
-The list does not include tools without a disambiguation component, such as morphological analyzers  [[http://nlp.fi.muni.cz/projekty/ajka/ajkacz.htm|Ajka]] or [[http://nlp.fi.muni.cz/czech-morphology-analyser/|Majka]].+<WRAP round info 75%> 
 +For additional resources see Wiki of the Association for Computational Linguistics – [[http://www.aclweb.org/aclwiki/index.php?title=List_of_resources_by_language|List of resources by language]] and [[https://languagetool.org|list]] of tools of varied coverage for more languages.
  
 Tools currently used in [[cnk:intercorp|InterCorp]], the parallel section of the Czech National Corpus, are underlined. Tools currently used in [[cnk:intercorp|InterCorp]], the parallel section of the Czech National Corpus, are underlined.
 </WRAP> </WRAP>
- 
  
  --- //Alexandr Rosen & corpora@uib.no subscribers//  --- //Alexandr Rosen & corpora@uib.no subscribers//