I think there may be something that prevents ingesting raw text files... This section of generate_ngrams.py builds the data for a progress bar, but it seems to assume that a toms.db exists:
|
philo_db_path: str = os.path.abspath(os.path.join(text, os.pardir, os.pardir, "toms.db")) |
This wouldn't be true if there was no Philologic database, and the user was just reading in texts files from the file system, I think?
I think there may be something that prevents ingesting raw text files... This section of
generate_ngrams.pybuilds the data for a progress bar, but it seems to assume that a toms.db exists:text-pair/lib/textpair/sequence_alignment/generate_ngrams.py
Line 170 in 29edcb8
This wouldn't be true if there was no Philologic database, and the user was just reading in texts files from the file system, I think?