# -*- cperl -*- =head1 NAME nat-initmat - initialize a sparse matrix with words co-occurrence. =head1 SYNOPSIS nat-initmat [ ] =head1 DESCRIPTION This tool is used internally by C and is not intended to be used independently. Basically, this tool takes two corpora files created by C and allocates a sparse matrix, where rows indexes correspond to word identifiers on the source corpus, and column indexes correspond to word identifiers on the target corpus. Cells count the words co-occurrence on the same sentence. The C file is then created with the matrix information. Optionally, you can pass to the system two exclude lists, as returned by the C tool. These words will be ignored, and counting will not be done for them. The matrix is saved and can be processed later by EM-Algorithm methods IPFP (C), Sample A (C) and Sample B (C). =head1 SEE ALSO nat-words2id, nat-pre, NATools documentation =head1 COPYRIGHT Copyright (C)2002-2009 Alberto Simoes and Jose Joao Almeida Copyright (C)1998 Djoerd Hiemstra GNU GENERAL PUBLIC LICENSE (LGPL) Version 2 (June 1991) =cut