hezar.models.mask_filling.roberta.roberta_mask_filling module¶
A RoBERTa Language Model (HuggingFace Transformers) wrapped by a Hezar Model class
- class hezar.models.mask_filling.roberta.roberta_mask_filling.RobertaMaskFilling(config, **kwargs)[source]¶
Bases:
Model
- compute_loss(logits: Tensor, labels: Tensor) Tensor [source]¶
Compute loss on the model outputs against the given labels
- Parameters:
logits – Logits tensor to compute loss on
labels – Labels tensor
Note: Subclasses can also override this method and add other arguments besides logits and labels
- Returns:
Loss tensor
- forward(token_ids, attention_mask=None, token_type_ids=None, position_ids=None, head_mask=None, inputs_embeds=None, encoder_hidden_states=None, encoder_attention_mask=None, output_attentions=None, output_hidden_states=None, **kwargs)[source]¶
Forward inputs through the model and return logits, etc.
- Parameters:
model_inputs – The required inputs for the model forward
- Returns:
A dict of outputs like logits, loss, etc.
- post_process(model_outputs: dict, top_k=1)[source]¶
Process model outputs and return human-readable results. Called in self.predict()
- Parameters:
model_outputs – model outputs to process
**kwargs – extra arguments specific to the derived class
- Returns:
Processed model output values and converted to human-readable results
- preprocess(inputs: str | List[str], **kwargs)[source]¶
Given raw inputs, preprocess the inputs and prepare them for model’s forward().
- Parameters:
raw_inputs – Raw model inputs
**kwargs – Extra kwargs specific to the model. See the model’s specific class for more info
- Returns:
A dict of inputs for model forward
- skip_keys_on_load = ['model.embeddings.position_ids', 'roberta.embeddings.position_ids']¶