Source: This example was kindly contributed by Pedro Henrique Luz de Araujo, R&D Center for Excellence and Public Sector Transformation - NEXT, Universidade de Brasília - UnB, Brasília, Brazil.
This project constructed a dataset for named entity recognition in Brazilian Legal Text composed of legislation and legal decision texts. As named entity categories we have Person, Organization, Time, Location, Legislation and Legal Decisions. The data was imported to and exported from WebAnno using the CoNLL-2002 format.
- Pedro Henrique Luz de Araujo, Teófilo E. de Campos, Renato R. R. de Oliveira, Matheus Stauffer, Samuel Couto, Paulo Bermejo. LeNER-Br: A Dataset for Named Entity Recognition in Brazilian Legal Text. Computational Processing of the Portuguese Language, Springer International Publishing. [PDF (pre-print)] [Publisher] [BIB]