APERTIUM-DESHTML(1) | General Commands Manual | APERTIUM-DESHTML(1) |
apertium-deshtml
—
HTML format processor for Apertium
apertium-deshtml |
[-hino ] [input_file
[output_file]] |
This tool is part of the Apertium open-source machine translation toolbox.
apertium-deshtml
is an HTML format
processor. Data should be passed through this processor before being piped
to lt-proc(1). The program takes input in the form of an
HTML document and produces output suitable for processing with
lt-proc(1). HTML tags and other format information are
enclosed in brackets so that lt-proc(1) treats them as
whitespace between words.
You could write the following to show how the word “gener” is analysed:
echo
"<b>gener</b>" | apertium-deshtml | lt-proc
ca-es.automorf.bin
apertium(1), apertium-desrtf(1), apertium-destxt(1), lt-proc(1)
Copyright © 2005, 2006 Universitat d'Alacant / Universidad de Alicante. This is free software. You may redistribute copies of it under the terms of the GNU General Public License.
Many... lurking in the dark and waiting for you!
March 21, 2006 | Apertium |