remi-or

2 followers · 1 following

View GitHub Profile

Recently created

Least recently created

Recently updated

Least recently updated

remi-or / Datasets.py

Last active August 16, 2021 10:54

Snippet for loading datasets

	# This snippet requires you to install Hugging Face's datasets module
	from datasets import load_dataset
	import pandas as pd

	Dataframe = pd.DataFrame({})

	questions = load_dataset('squad')['train']['question'][:3000]
	Dataframe = Dataframe.append(pd.DataFrame({'Text' : questions, 'Source' : 'squad'}))

	questions = load_dataset('hotpot_qa', 'distractor')['train']['question'][:3000]

remi-or / Sizes.py

Created August 16, 2021 11:01

	import matplotlib.pyplot as plt
	import seaborn as sns

	def average_word_count(list_of_texts):
	"""
	Returns the average word count of a list of texts.
	"""
	total_count = 0
	for text in list_of_texts:
	text = text.replace("'", ' ')

remi-or / roberta_peek.py

Created January 17, 2022 18:04

Roberta peek

	from transformers import AutoModelForMaskedLM

	roberta = AutoModelForMaskedLM.from_pretrained("roberta-large")

	print(roberta)

remi-or / roberta_recursive_peek.py

Last active January 17, 2022 18:22

	from typing import Any
	from transformers import AutoModelForMaskedLM

	roberta = AutoModelForMaskedLM.from_pretrained("roberta-large")

	def visualize_children(
	object : Any,
	level : int = 0,
	) -> None:
	"""

remi-or / distill_roberta.py

Last active January 17, 2022 18:32

	from transformers.models.roberta.modeling_roberta import RobertaPreTrainedModel, RobertaConfig

	def distill_roberta(
	teacher_model : RobertaPreTrainedModel,
	) -> RobertaPreTrainedModel:
	"""
	Distilates a RoBERTa (teacher_model) like would DistilBERT for a BERT model.
	The student model has the same configuration, except for the number of hidden layers, which is // by 2.
	The student layers are initilized by copying one out of two layers of the teacher, starting with layer 0.
	The head of the teacher is also copied.

remi-or / distill_roberta_weights.py

Created January 17, 2022 18:34

	from transformers.models.roberta.modeling_roberta import RobertaEncoder, RobertaModel
	from torch.nn import Module

	def distill_roberta_weights(
	teacher : Module,
	student : Module,
	) -> None:
	"""
	Recursively copies the weights of the (teacher) to the (student).
	This function is meant to be first called on a RobertaFor... model, but is then called on every children of that model recursively.

remi-or / get_logits.py

Created January 18, 2022 17:36

	from torch import Tensor

	def get_logits(
	model : RobertaPreTrainedModel,
	input_ids : Tensor,
	attention_mask : Tensor,
	) -> Tensor:
	"""
	Given a RoBERTa (model) for classification and the couple of (input_ids) and (attention_mask),
	returns the logits corresponding to the prediction.

remi-or / distillation_loss.py

Created January 18, 2022 17:55

	import torch
	from torch.nn import CrossEntropyLoss, CosineEmbeddingLoss

	def distillation_loss(
	teacher_logits : Tensor,
	student_logits : Tensor,
	labels : Tensor,
	temperature : float = 1.0,
	) -> Tensor:
	"""

remi-or / distillator.py

Created January 19, 2022 22:16

	## Imports
	from typing import Tuple
	import torch
	from torch import Module, Tensor
	from transformers.models.roberta.modeling_roberta import RobertaPreTrainedModel, RobertaConfig, RobertaModel, RobertaEncoder
	from torch.nn import CrossEntropyLoss, CosineEmbeddingLoss



	## Function

remi-or / FCOSRside.ipynb

Created May 18, 2022 19:25

Sorry, something went wrong. Reload?

Sorry, we cannot display this file.

Sorry, this file is invalid so it cannot be displayed.

OlderNewer