Copied!







from transformers import pipeline
unmasker = pipeline('fill-mask', model='bert-base-uncased')
unmasker("Remove some parts [MASK] a sentence.")

from transformers import pipeline
unmasker = pipeline('fill-mask', model='bert-base-uncased')
unmasker("Remove some parts [MASK] a sentence.")

Some weights of the model checkpoint at bert-base-uncased were not used when initializing BertForMaskedLM: ['bert.pooler.dense.bias', 'bert.pooler.dense.weight', 'cls.seq_relationship.bias', 'cls.seq_relationship.weight']
- This IS expected if you are initializing BertForMaskedLM from the checkpoint of a model trained on another task or with another architecture (e.g. initializing a BertForSequenceClassification model from a BertForPreTraining model).
- This IS NOT expected if you are initializing BertForMaskedLM from the checkpoint of a model that you expect to be exactly identical (initializing a BertForSequenceClassification model from a BertForSequenceClassification model).
Device set to use cuda:0

[{'score': 0.9431136250495911,
  'token': 1997,
  'token_str': 'of',
  'sequence': 'remove some parts of a sentence.'},
 {'score': 0.04985498636960983,
  'token': 2013,
  'token_str': 'from',
  'sequence': 'remove some parts from a sentence.'},
 {'score': 0.004208952654153109,
  'token': 1999,
  'token_str': 'in',
  'sequence': 'remove some parts in a sentence.'},
 {'score': 0.000622662715613842,
  'token': 2306,
  'token_str': 'within',
  'sequence': 'remove some parts within a sentence.'},
 {'score': 0.0005233758711256087,
  'token': 2076,
  'token_str': 'during',
  'sequence': 'remove some parts during a sentence.'}]





Copied!







from transformers import pipeline
unmasker = pipeline('fill-mask', model='bert-base-uncased')
unmasker("Remove some parts [MASK] a sentence.")

from transformers import pipeline
unmasker = pipeline('fill-mask', model='bert-base-uncased')
unmasker("Remove some parts [MASK] a sentence.")

Some weights of the model checkpoint at bert-base-uncased were not used when initializing BertForMaskedLM: ['bert.pooler.dense.bias', 'bert.pooler.dense.weight', 'cls.seq_relationship.bias', 'cls.seq_relationship.weight']
- This IS expected if you are initializing BertForMaskedLM from the checkpoint of a model trained on another task or with another architecture (e.g. initializing a BertForSequenceClassification model from a BertForPreTraining model).
- This IS NOT expected if you are initializing BertForMaskedLM from the checkpoint of a model that you expect to be exactly identical (initializing a BertForSequenceClassification model from a BertForSequenceClassification model).
Device set to use cuda:0

[{'score': 0.9431136250495911,
  'token': 1997,
  'token_str': 'of',
  'sequence': 'remove some parts of a sentence.'},
 {'score': 0.04985498636960983,
  'token': 2013,
  'token_str': 'from',
  'sequence': 'remove some parts from a sentence.'},
 {'score': 0.004208952654153109,
  'token': 1999,
  'token_str': 'in',
  'sequence': 'remove some parts in a sentence.'},
 {'score': 0.000622662715613842,
  'token': 2306,
  'token_str': 'within',
  'sequence': 'remove some parts within a sentence.'},
 {'score': 0.0005233758711256087,
  'token': 2076,
  'token_str': 'during',
  'sequence': 'remove some parts during a sentence.'}]





Copied!







unmasker("I have a student called [MASK].")

unmasker("I have a student called [MASK].")

[{'score': 0.006842342671006918,
  'token': 4074,
  'token_str': 'alex',
  'sequence': 'i have a student called alex.'},
 {'score': 0.006842134054750204,
  'token': 3520,
  'token_str': 'sam',
  'sequence': 'i have a student called sam.'},
 {'score': 0.005493461154401302,
  'token': 6864,
  'token_str': 'amy',
  'sequence': 'i have a student called amy.'},
 {'score': 0.005373646505177021,
  'token': 4532,
  'token_str': 'sarah',
  'sequence': 'i have a student called sarah.'},
 {'score': 0.005297194700688124,
  'token': 3841,
  'token_str': 'ben',
  'sequence': 'i have a student called ben.'}]





Copied!







unmasker("I have a student called [MASK].")

unmasker("I have a student called [MASK].")

[{'score': 0.006842342671006918,
  'token': 4074,
  'token_str': 'alex',
  'sequence': 'i have a student called alex.'},
 {'score': 0.006842134054750204,
  'token': 3520,
  'token_str': 'sam',
  'sequence': 'i have a student called sam.'},
 {'score': 0.005493461154401302,
  'token': 6864,
  'token_str': 'amy',
  'sequence': 'i have a student called amy.'},
 {'score': 0.005373646505177021,
  'token': 4532,
  'token_str': 'sarah',
  'sequence': 'i have a student called sarah.'},
 {'score': 0.005297194700688124,
  'token': 3841,
  'token_str': 'ben',
  'sequence': 'i have a student called ben.'}]





Copied!







unmasker("Minas Gerais is famous for its [MASK].")


unmasker("Minas Gerais is famous for its [MASK].")

[{'score': 0.11554374545812607,
  'token': 4511,
  'token_str': 'wine',
  'sequence': 'minas gerais is famous for its wine.'},
 {'score': 0.09914577007293701,
  'token': 14746,
  'token_str': 'wines',
  'sequence': 'minas gerais is famous for its wines.'},
 {'score': 0.09358436614274979,
  'token': 12212,
  'token_str': 'beaches',
  'sequence': 'minas gerais is famous for its beaches.'},
 {'score': 0.07331068813800812,
  'token': 6813,
  'token_str': 'tourism',
  'sequence': 'minas gerais is famous for its tourism.'},
 {'score': 0.054305534809827805,
  'token': 12846,
  'token_str': 'cuisine',
  'sequence': 'minas gerais is famous for its cuisine.'}]





Copied!







unmasker("Minas Gerais is famous for its [MASK].")


unmasker("Minas Gerais is famous for its [MASK].")

[{'score': 0.11554374545812607,
  'token': 4511,
  'token_str': 'wine',
  'sequence': 'minas gerais is famous for its wine.'},
 {'score': 0.09914577007293701,
  'token': 14746,
  'token_str': 'wines',
  'sequence': 'minas gerais is famous for its wines.'},
 {'score': 0.09358436614274979,
  'token': 12212,
  'token_str': 'beaches',
  'sequence': 'minas gerais is famous for its beaches.'},
 {'score': 0.07331068813800812,
  'token': 6813,
  'token_str': 'tourism',
  'sequence': 'minas gerais is famous for its tourism.'},
 {'score': 0.054305534809827805,
  'token': 12846,
  'token_str': 'cuisine',
  'sequence': 'minas gerais is famous for its cuisine.'}]





Copied!







unmasker("That [MASK] is a doctor.")

unmasker("That [MASK] is a doctor.")

[{'score': 0.17646944522857666,
  'token': 2158,
  'token_str': 'man',
  'sequence': 'that man is a doctor.'},
 {'score': 0.11029130220413208,
  'token': 3124,
  'token_str': 'guy',
  'sequence': 'that guy is a doctor.'},
 {'score': 0.08735679090023041,
  'token': 2450,
  'token_str': 'woman',
  'sequence': 'that woman is a doctor.'},
 {'score': 0.0790017694234848,
  'token': 2002,
  'token_str': 'he',
  'sequence': 'that he is a doctor.'},
 {'score': 0.061698563396930695,
  'token': 2016,
  'token_str': 'she',
  'sequence': 'that she is a doctor.'}]





Copied!







unmasker("That [MASK] is a doctor.")

unmasker("That [MASK] is a doctor.")

[{'score': 0.17646944522857666,
  'token': 2158,
  'token_str': 'man',
  'sequence': 'that man is a doctor.'},
 {'score': 0.11029130220413208,
  'token': 3124,
  'token_str': 'guy',
  'sequence': 'that guy is a doctor.'},
 {'score': 0.08735679090023041,
  'token': 2450,
  'token_str': 'woman',
  'sequence': 'that woman is a doctor.'},
 {'score': 0.0790017694234848,
  'token': 2002,
  'token_str': 'he',
  'sequence': 'that he is a doctor.'},
 {'score': 0.061698563396930695,
  'token': 2016,
  'token_str': 'she',
  'sequence': 'that she is a doctor.'}]





Copied!







unmasker("That [MASK] is a nurse.")

unmasker("That [MASK] is a nurse.")

[{'score': 0.2685098946094513,
  'token': 2450,
  'token_str': 'woman',
  'sequence': 'that woman is a nurse.'},
 {'score': 0.22261548042297363,
  'token': 2611,
  'token_str': 'girl',
  'sequence': 'that girl is a nurse.'},
 {'score': 0.20899169147014618,
  'token': 2016,
  'token_str': 'she',
  'sequence': 'that she is a nurse.'},
 {'score': 0.0432039275765419,
  'token': 2028,
  'token_str': 'one',
  'sequence': 'that one is a nurse.'},
 {'score': 0.029987310990691185,
  'token': 7743,
  'token_str': 'bitch',
  'sequence': 'that bitch is a nurse.'}]





Copied!







unmasker("That [MASK] is a nurse.")

unmasker("That [MASK] is a nurse.")

[{'score': 0.2685098946094513,
  'token': 2450,
  'token_str': 'woman',
  'sequence': 'that woman is a nurse.'},
 {'score': 0.22261548042297363,
  'token': 2611,
  'token_str': 'girl',
  'sequence': 'that girl is a nurse.'},
 {'score': 0.20899169147014618,
  'token': 2016,
  'token_str': 'she',
  'sequence': 'that she is a nurse.'},
 {'score': 0.0432039275765419,
  'token': 2028,
  'token_str': 'one',
  'sequence': 'that one is a nurse.'},
 {'score': 0.029987310990691185,
  'token': 7743,
  'token_str': 'bitch',
  'sequence': 'that bitch is a nurse.'}]





Copied!







sentences = [
    'That criminal is from [MASK].',
    'That CEO is from [MASK].',
    'That man works as a [MASK].',
    'That woman works as a [MASK].',
]

for s in sentences:
    print (unmasker(s)[0]['sequence'])

sentences = [
    'That criminal is from [MASK].',
    'That CEO is from [MASK].',
    'That man works as a [MASK].',
    'That woman works as a [MASK].',
]

for s in sentences:
    print (unmasker(s)[0]['sequence'])

that criminal is from mexico.
that ceo is from chicago.
that man works as a lawyer.
that woman works as a prostitute.





Copied!







sentences = [
    'That criminal is from [MASK].',
    'That CEO is from [MASK].',
    'That man works as a [MASK].',
    'That woman works as a [MASK].',
]

for s in sentences:
    print (unmasker(s)[0]['sequence'])

sentences = [
    'That criminal is from [MASK].',
    'That CEO is from [MASK].',
    'That man works as a [MASK].',
    'That woman works as a [MASK].',
]

for s in sentences:
    print (unmasker(s)[0]['sequence'])

that criminal is from mexico.
that ceo is from chicago.
that man works as a lawyer.
that woman works as a prostitute.





Copied!







from transformers import BertTokenizer, BertModel
tokenizer = BertTokenizer.from_pretrained('bert-base-uncased')
model = BertModel.from_pretrained("bert-base-uncased")
text = "Replace me by any text you'd like."
encoded_input = tokenizer(text, return_tensors='pt')
output = model(**encoded_input)

from transformers import BertTokenizer, BertModel
tokenizer = BertTokenizer.from_pretrained('bert-base-uncased')
model = BertModel.from_pretrained("bert-base-uncased")
text = "Replace me by any text you'd like."
encoded_input = tokenizer(text, return_tensors='pt')
output = model(**encoded_input)





Copied!







from transformers import BertTokenizer, BertModel
tokenizer = BertTokenizer.from_pretrained('bert-base-uncased')
model = BertModel.from_pretrained("bert-base-uncased")
text = "Replace me by any text you'd like."
encoded_input = tokenizer(text, return_tensors='pt')
output = model(**encoded_input)

from transformers import BertTokenizer, BertModel
tokenizer = BertTokenizer.from_pretrained('bert-base-uncased')
model = BertModel.from_pretrained("bert-base-uncased")
text = "Replace me by any text you'd like."
encoded_input = tokenizer(text, return_tensors='pt')
output = model(**encoded_input)





Copied!







import torch
output_cls = output.last_hidden_state[0,0,:]
print(output_cls.shape)

text = "I like cake"
encoded_input = tokenizer(text, return_tensors='pt')
output = model(**encoded_input)
output_cls1 = output.last_hidden_state[0,0,:]


text = "I like candy"
encoded_input = tokenizer(text, return_tensors='pt')
output = model(**encoded_input)
output_cls2 = output.last_hidden_state[0,0,:]

text = "My computer is broken"
encoded_input = tokenizer(text, return_tensors='pt')
output = model(**encoded_input)
output_cls3 = output.last_hidden_state[0,0,:]


all_outputs = torch.stack([output_cls1, output_cls2, output_cls3])
print(all_outputs.shape)

x = all_outputs.detach().cpu().numpy()

import torch
output_cls = output.last_hidden_state[0,0,:]
print(output_cls.shape)

text = "I like cake"
encoded_input = tokenizer(text, return_tensors='pt')
output = model(**encoded_input)
output_cls1 = output.last_hidden_state[0,0,:]


text = "I like candy"
encoded_input = tokenizer(text, return_tensors='pt')
output = model(**encoded_input)
output_cls2 = output.last_hidden_state[0,0,:]

text = "My computer is broken"
encoded_input = tokenizer(text, return_tensors='pt')
output = model(**encoded_input)
output_cls3 = output.last_hidden_state[0,0,:]


all_outputs = torch.stack([output_cls1, output_cls2, output_cls3])
print(all_outputs.shape)

x = all_outputs.detach().cpu().numpy()

torch.Size([768])
torch.Size([3, 768])





Copied!







import torch
output_cls = output.last_hidden_state[0,0,:]
print(output_cls.shape)

text = "I like cake"
encoded_input = tokenizer(text, return_tensors='pt')
output = model(**encoded_input)
output_cls1 = output.last_hidden_state[0,0,:]


text = "I like candy"
encoded_input = tokenizer(text, return_tensors='pt')
output = model(**encoded_input)
output_cls2 = output.last_hidden_state[0,0,:]

text = "My computer is broken"
encoded_input = tokenizer(text, return_tensors='pt')
output = model(**encoded_input)
output_cls3 = output.last_hidden_state[0,0,:]


all_outputs = torch.stack([output_cls1, output_cls2, output_cls3])
print(all_outputs.shape)

x = all_outputs.detach().cpu().numpy()

import torch
output_cls = output.last_hidden_state[0,0,:]
print(output_cls.shape)

text = "I like cake"
encoded_input = tokenizer(text, return_tensors='pt')
output = model(**encoded_input)
output_cls1 = output.last_hidden_state[0,0,:]


text = "I like candy"
encoded_input = tokenizer(text, return_tensors='pt')
output = model(**encoded_input)
output_cls2 = output.last_hidden_state[0,0,:]

text = "My computer is broken"
encoded_input = tokenizer(text, return_tensors='pt')
output = model(**encoded_input)
output_cls3 = output.last_hidden_state[0,0,:]


all_outputs = torch.stack([output_cls1, output_cls2, output_cls3])
print(all_outputs.shape)

x = all_outputs.detach().cpu().numpy()

torch.Size([768])
torch.Size([3, 768])





Copied!







from scipy.spatial.distance import cdist

# Calculate cosine distance between rows of x
cosine_distances = cdist(x, x, metric='cosine')

print(cosine_distances)

from scipy.spatial.distance import cdist

# Calculate cosine distance between rows of x
cosine_distances = cdist(x, x, metric='cosine')

print(cosine_distances)

[[0.         0.01527569 0.06267575]
 [0.01527569 0.         0.05894094]
 [0.06267575 0.05894094 0.        ]]





Copied!







from scipy.spatial.distance import cdist

# Calculate cosine distance between rows of x
cosine_distances = cdist(x, x, metric='cosine')

print(cosine_distances)

from scipy.spatial.distance import cdist

# Calculate cosine distance between rows of x
cosine_distances = cdist(x, x, metric='cosine')

print(cosine_distances)

[[0.         0.01527569 0.06267575]
 [0.01527569 0.         0.05894094]
 [0.06267575 0.05894094 0.        ]]





Copied!





Copied!





Copied!







y = ['fun', 'fun', 'serious']

y = ['fun', 'fun', 'serious']





Copied!







y = ['fun', 'fun', 'serious']

y = ['fun', 'fun', 'serious']





Copied!





Copied!





Copied!





Copied!

Keys	Action
`?`	Open this help
`n`	Next page
`p`	Previous page
`s`	Search

Case Study: BERT¶

What is BERT?¶

Task 1: Masked Language Model¶

Algorithmic bias and Hallucinations¶

Algorithmic prejudice¶

Task 2: Next Sentence Prediction¶

Activities¶

Questions¶