NATools is a workbench for parallel corpora processing. It includes a sentence aligner and a Probabilistic Translation Dictionary extractor, a word aligner and a set of other tools to study the aligned parallel corpora.
Follow the links below for more information.
| Corpus | Languages | Full | Filtered | Description |
|---|---|---|---|---|
| COMPARA | PT:EN | 5.9MB | 2.6MB | Full COMPARA corpus |
| Constituição | PT:EN | 288KB | 120KB | Fourth Portuguese Constitution |
| Constituição | PT:ES | 212KB | 80KB | Fourth Portuguese Constitution |
| Constituição | PT:FR | 268KB | 104KB | Fourth Portuguese Constitution |