apertium-deshtml(1) | apertium-deshtml(1) |
apertium-deshtml - This application is part of ( apertium )
This tool is part of the apertium open-source machine translation toolbox: http://www.apertium.org.
apertium-deshtml [ -h ] [ -i ] [ -n ] [ -o ] [ <input file> [ <output file> ] ]
apertium-deshtml is an HTML format processor. Data should be passed through this processor before being piped to lt-proc. The program takes input in the form of an HTML document and produces output suitable for processing with lt-proc. HTML tags and other format information are enclosed in brackets so that lt-proc treats them as whitespace between words.
-i Makes the addition of trailing sentence terminator (".") unconditional, often leading to duplicates.
-n Suppresses the addition of a trailing sentence terminator.
-o Inserts a "❡" (U+2761 CURVED STEM PARAGRAPH SIGN ORNAMENT) at the end of <h[1-6]> and <title> tags.
apertium-destxt(1), apertium-desrtf(1), lt-proc(1), apertium(1).
Lots of...lurking in the dark and waiting for you!
Copyright (c) 2005, 2006 Universitat d'Alacant / Universidad de Alicante. This is free software. You may redistribute copies of it under the terms of the GNU General Public License <http://www.gnu.org/licenses/gpl.html>.
2006-03-21 |