Home

Results, data and software

1) Corpora annotated with senses in NAF format

Sense annotated corpora converted into NAF format:

  • SemCor
  • SensEval2: traditional all-words task
  • SenseEval3: traditional all-words task
  • SemEval-2010 task 17: WSD on a specific domain
  • SemEval-2007 task 17 all words
  • SemEval-2013 Task 12: Multilingual Word Sense Disambiguation (langs en,es,fr,it,de)

Find this data on GitHub: http://github.com/rubenIzquierdo/wsd_corpora

2) Output for participants in SensEval/SemEval WSD tasks in XML

You will find for the last senseval/semeval WSD tasks all the outputs from the participants systems in an homogeneous XML format, which includes also the gold keys. The tasks covered are:

  • SensEval2: traditional all-words task
  • SenseEval3: traditional all-words task
  • SemEval-2010 task 17: WSD on a specific domain
  • SemEval-2007 task 17 all words
  • SemEval-2013 Task 12: Multilingual Word Sense Disambiguation

All the XML files and data can be found at: https://github.com/rubenIzquierdo/sval_systems

3) Semantic Class Manager

Code and Python API to access and query different sets of semantic classes:

It allows you to:

  • Get the semantic class associated to a given synset offset in WordNet
  • Get the semantic class associated to a given lexical key in WordNet
  • Get all the semantic classes for all the senses of a certain lemma and pos-tag.

Find this repository and code on GitHub: https://github.com/rubenIzquierdo/semantic_class_manager

Leave a Reply

Your email address will not be published.