llmcompressor.transformers.data.c4
Classes:
-
C4Dataset–Child text generation class for the C4 dataset
C4Dataset
Bases: TextGenerationDataset
Child text generation class for the C4 dataset
Parameters:
-
dataset_args(DatasetArguments) –configuration settings for dataset loading
-
split(str) –split from dataset to load, for instance
testortrain[:5%] -
processor(Processor) –processor or tokenizer to use on dataset