UNI2BETA(1) | General Commands Manual | UNI2BETA(1) |
uni2beta - Convert a polytonic Greek document from UTF-8 Unicode to Beta Code
uni2beta [{-l|-u}] [-i input_file.beta] [-o output_file.utf8]
uni2beta(1) reads a document encoded using Unicode and converts it to Beta Code. Beta Code is an ASCII-only encoding scheme most commonly used for digital representation of polytonic Greek.
Beta Code has become a widely-adopted standard for encoding classical Greek. It was developed by David Packard in the 1970s and adopted by the Thesaurus Linguae Graecae (TLG) Project at the University of California, Irvine shortly thereafter. This encoding was later adopted by the Perseus Project in the 1980s (originally at Harvard University, now at Tufts University) and by many other collections of classical and Koine Greek. Today, the TLG corpus alone contains over 100 million words from classical to Byzantine Greek.
The TLG uses uppercase Latin letters for encoding; the Perseus Project uses lowercase. uni2beta(1) defaults to lowercase output. This can be overridden with the `-u' (generate uppercase) option.
Many classicists who use Beta Code have been actively involved in The Unicode Standard, with evolving recommendations for mapping between Beta Code and Unicode. uni2beta(1) provides a capability for GNU/Linux users who wish to convert Unicode texts to Beta Code.
Any encountered UTF-8 Unicode Byte Order Mark (BOM), U+FFFE, is ignored.
Note: Thesaurus Linguae Graecae and TLG are registered trademarks of the University of California.
Sample usage:
Polytonic Greek files in UTF-8 Unicode.
uni2beta was written by Paul Hardy.
uni2beta is Copyright © 2018 Paul Hardy.
This program is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation; either version 2 of the License, or (at your option) any later version.
No known bugs exist.
2018 Jun 23 |