hezar.data.datasets package¶
Submodules¶
- hezar.data.datasets.dataset module
- hezar.data.datasets.image_captioning_dataset module
- hezar.data.datasets.ocr_dataset module
OCRDatasetOCRDatasetConfigOCRDatasetConfig.id2labelOCRDatasetConfig.images_paths_columnOCRDatasetConfig.invalid_charactersOCRDatasetConfig.max_lengthOCRDatasetConfig.nameOCRDatasetConfig.pathOCRDatasetConfig.reverse_digitsOCRDatasetConfig.reverse_textOCRDatasetConfig.taskOCRDatasetConfig.text_columnOCRDatasetConfig.text_split_type
TextSplitType
- hezar.data.datasets.sequence_labeling_dataset module
SequenceLabelingDatasetSequenceLabelingDatasetConfigSequenceLabelingDatasetConfig.ignore_indexSequenceLabelingDatasetConfig.is_iob_schemaSequenceLabelingDatasetConfig.label_all_tokensSequenceLabelingDatasetConfig.max_lengthSequenceLabelingDatasetConfig.nameSequenceLabelingDatasetConfig.pathSequenceLabelingDatasetConfig.tags_fieldSequenceLabelingDatasetConfig.taskSequenceLabelingDatasetConfig.tokens_field
- hezar.data.datasets.speech_recognition_dataset module
SpeechRecognitionDatasetSpeechRecognitionDatasetConfigSpeechRecognitionDatasetConfig.audio_array_columnSpeechRecognitionDatasetConfig.audio_array_paddingSpeechRecognitionDatasetConfig.audio_columnSpeechRecognitionDatasetConfig.audio_file_path_columnSpeechRecognitionDatasetConfig.labels_max_lengthSpeechRecognitionDatasetConfig.labels_paddingSpeechRecognitionDatasetConfig.max_audio_array_lengthSpeechRecognitionDatasetConfig.nameSpeechRecognitionDatasetConfig.pathSpeechRecognitionDatasetConfig.sampling_rateSpeechRecognitionDatasetConfig.taskSpeechRecognitionDatasetConfig.transcript_column
- hezar.data.datasets.text_classification_dataset module
- hezar.data.datasets.text_summarization_dataset module
TextSummarizationDatasetTextSummarizationDatasetConfigTextSummarizationDatasetConfig.labels_max_lengthTextSummarizationDatasetConfig.max_lengthTextSummarizationDatasetConfig.nameTextSummarizationDatasetConfig.pathTextSummarizationDatasetConfig.prefixTextSummarizationDatasetConfig.summary_fieldTextSummarizationDatasetConfig.taskTextSummarizationDatasetConfig.text_fieldTextSummarizationDatasetConfig.title_field