NATools is a workbench for parallel corpora processing. It includes a sentence aligner and a Probabilistic Translation Dictionary extractor, a word aligner and a set of other tools to study the aligned parallel corpora.
Follow the links below for more information.
Corpus | Languages | Full | Filtered | Description |
---|---|---|---|---|
COMPARA | PT:EN | 5.9MB | 2.6MB | Full COMPARA corpus |
Constituição | PT:EN | 288KB | 120KB | Fourth Portuguese Constitution |
Constituição | PT:ES | 212KB | 80KB | Fourth Portuguese Constitution |
Constituição | PT:FR | 268KB | 104KB | Fourth Portuguese Constitution |