8.1.3.1.2.1.1. cltk.corpora.lat.phi package¶
8.1.3.1.2.1.1.1. Submodules¶
8.1.3.1.2.1.1.2. cltk.corpora.lat.phi.file_utils module¶
Higher-level (i.e., user-friendly) functions for quickly reading
PHI5 data after it has been processed by TLGU()
.
-
cltk.corpora.lat.phi.file_utils.
phi5_plaintext_cleanup
(text, rm_punctuation=False, rm_periods=False)[source]¶ Remove and substitute post-processing for Latin PHI5 text. TODO: Surely more junk to pull out. Please submit bugs! TODO: This is a rather slow now, help in speeding up welcome.
Reads PHI5 index and builds a list of absolute filepaths.
8.1.3.1.2.1.1.3. cltk.corpora.lat.phi.phi5_index module¶
Indices to the PHI5 Latin corpus.