Skip to content

Instantly share code, notes, and snippets.

@pythonlessons
Created August 24, 2023 10:12
Show Gist options
  • Save pythonlessons/cb89b4f7fe07daa9c150e83a40611eb6 to your computer and use it in GitHub Desktop.
Save pythonlessons/cb89b4f7fe07daa9c150e83a40611eb6 to your computer and use it in GitHub Desktop.
transformers_nlp_data
for data_batch in train_dataProvider:
(encoder_inputs, decoder_inputs), decoder_outputs = data_batch
encoder_inputs_str = tokenizer.detokenize(encoder_inputs)
decoder_inputs_str = detokenizer.detokenize(decoder_inputs, remove_start_end=False)
decoder_outputs_str = detokenizer.detokenize(decoder_outputs, remove_start_end=False)
print(encoder_inputs_str)
print(decoder_inputs_str)
print(decoder_outputs_str)
break
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment