Dataset

class autotm.fitness.tm.Dataset(base_path: str, topic_count: int)
property base_path: str

return: The path to the directory that contains this dataset files

load_dataset()

Partially loads this dataset files creating supplementary data_generator structures and entities like artm.BatchVectorizer

property texts: List[str]

return: a list of PROCESSED texts of the corpus

verify_dataset_files()

Verifies that mandatory files exist in the dataset folder defined by :py:func:’.base_path’