Revision history for Perl extension Lingua::PT::PLNbase. 0.29 Sun 20 Feb 10:07:00 WET 2022 - Added missing dependency 0.28 Mon 15 Feb 15:52:24 WET 2022 - utf8 input 0.27 Sat Oct 4 16:31:05 WEST 2014 - localize $_ 0.26 Sun Mar 9 21:38:49 WET 2014 - protection of bookcleaner markup "_sec:..._" - PLNbase is now written in UTF-8 - try to choose setlocale(LC_CTYPE, "pt_PT|pt_BR|???") to be less dependent of present LC_CTYPE. - fix a bug related with xml tags tokenizing in "cqptokens" - incorporated lots of new rules for syllable division (thanks to João Machado, from P-Pal project) 0.25 Mon Jun 25 10:58:35 WEST 2012 - Pod encoding fixes 0.24 Fri Jan 21 17:52:32 WET 2011 - Added support to unicode on cqptokens function 0.23 Tue Nov 23 12:32:16 WET 2010 - Added missing dependency. 0.22 Sat Nov 20 13:44:00 2010 - Fixed tests not to use external diff command, and to create temporary files. 0.21 Sat Oct 24 21:58:00 2009 - Added 'atomos' function alias to 'atomiza' 0.20 Sat Mar 15 14:50:00 2008 - Added documentation the 'sentences' command - close output file on 'fsentences' function to flush filehandle 0.19 Mon Feb 25 12:36:00 2008 - Bug fixes - Test bug fix 0.18 Sat Jun 30 11:58:00 2007 - Added tokenization support for most EN and FR texts - Bug fixes 0.17 Tue Nov 21 12:40:00 2006 - Added some functions like xmlsentences and cqptokens - Added some regular expressions on tokenizer 0.16 Mon Nov 20 8:20:01 2006 - Upload of 0.15 failed 0.15 Mon Nov 20 8:20:00 2006 - Change behaviour on tokenizing quotes by configuration 0.14 Sat Jun 4 19:57:00 2005 - Force PT encoding for tests (natural for Lingua::PT modules) 0.13 Thu May 20 9:00:00 2005 - Corrected a bunch of bugs 0.12 Thu Mar 31 15:24:00 2005 - Added pod coverage tests missing 0.11 Wed Mar 16 21:52:00 2005 - Added a script to simplify fsentence usage 0.10 Wed Mar 16 16:00:00 2005 - fsentence supports tokenization and NATools output format; 0.09 Thu Nov 11 9:21:00 2004 - Added some features to segmenter; 0.08 Wed Nov 10 9:05:00 2004 - Corrected some more tests for bad locale systems; 0.07 Tue Nov 9 19:38:00 2004 - Rely on Lingua::PT::Abbrev for abbreviations; - Added some heuristics for tokenization; - Added some tests; 0.06 Tue Oct 5 12:00:00 2004 - finally, corrected the tests. Sorry for the inconvenience. 0.05 Mon Oct 4 09:33:33 2004 - corrected tests skip in case of weird locale. 0.04 Fri Oct 1 16:00:00 2004 - added has_accents and remove_accents functions - added tests for POD correctness and POD coverage 0.03 Thu Sep 2 10:46:10 2004 - added some more tests - added other version of the tokenizer to the documentation - removed failing tests to one of the tokenizers (bad practice) - added documentation for fsentences - corrected bug with wrong substitution 0.02 Tue May 18 21:32:33 2004 - changed module description for CPAN 0.01 Mon Mar 29 14:11:52 2004 - original version; created by h2xs 1.23 with options -X -O -b5.6.0 -n Lingua::PT::PLNbase