Copied!







text = """
Llamas are fascinating animals that are found in the Andes Mountains.
Yoda says that cute animals they are.
Are llamas cute? Yes they are!
A llama never leaves its herd and will protect it with its life.
They are known for their long necks and thick fur.
Llamas are used as pack animals by indigenous people because they are strong and can carry heavy loads.
They are also very social animals and are often seen in groups.
Llamas ARE herbivores, which means they are only eating plants.
They are also known for their gentle and calm nature, which makes them popular in petting zoos.
Overall, llamas are remarkable creatures that are a joy to observe and are important to the cultures where they are found.
"""

text = """
Llamas are fascinating animals that are found in the Andes Mountains.
Yoda says that cute animals they are.
Are llamas cute? Yes they are!
A llama never leaves its herd and will protect it with its life.
They are known for their long necks and thick fur.
Llamas are used as pack animals by indigenous people because they are strong and can carry heavy loads.
They are also very social animals and are often seen in groups.
Llamas ARE herbivores, which means they are only eating plants.
They are also known for their gentle and calm nature, which makes them popular in petting zoos.
Overall, llamas are remarkable creatures that are a joy to observe and are important to the cultures where they are found.
"""





Copied!







text = """
Llamas are fascinating animals that are found in the Andes Mountains.
Yoda says that cute animals they are.
Are llamas cute? Yes they are!
A llama never leaves its herd and will protect it with its life.
They are known for their long necks and thick fur.
Llamas are used as pack animals by indigenous people because they are strong and can carry heavy loads.
They are also very social animals and are often seen in groups.
Llamas ARE herbivores, which means they are only eating plants.
They are also known for their gentle and calm nature, which makes them popular in petting zoos.
Overall, llamas are remarkable creatures that are a joy to observe and are important to the cultures where they are found.
"""

text = """
Llamas are fascinating animals that are found in the Andes Mountains.
Yoda says that cute animals they are.
Are llamas cute? Yes they are!
A llama never leaves its herd and will protect it with its life.
They are known for their long necks and thick fur.
Llamas are used as pack animals by indigenous people because they are strong and can carry heavy loads.
They are also very social animals and are often seen in groups.
Llamas ARE herbivores, which means they are only eating plants.
They are also known for their gentle and calm nature, which makes them popular in petting zoos.
Overall, llamas are remarkable creatures that are a joy to observe and are important to the cultures where they are found.
"""





Copied!







def find_words(word_to_find, text):
    words = text.split()
    matching_words = []
    for w in words:
        if w == word_to_find:
            matching_words.append(w)
    return len(matching_words)

word_to_find = "are"
count = find_words(word_to_find, text)
print(f'The word "are" appears {count} times in this text.')


def find_words(word_to_find, text):
    words = text.split()
    matching_words = []
    for w in words:
        if w == word_to_find:
            matching_words.append(w)
    return len(matching_words)

word_to_find = "are"
count = find_words(word_to_find, text)
print(f'The word "are" appears {count} times in this text.')

The word "are" appears 13 times in this text.





Copied!







def find_words(word_to_find, text):
    words = text.split()
    matching_words = []
    for w in words:
        if w == word_to_find:
            matching_words.append(w)
    return len(matching_words)

word_to_find = "are"
count = find_words(word_to_find, text)
print(f'The word "are" appears {count} times in this text.')


def find_words(word_to_find, text):
    words = text.split()
    matching_words = []
    for w in words:
        if w == word_to_find:
            matching_words.append(w)
    return len(matching_words)

word_to_find = "are"
count = find_words(word_to_find, text)
print(f'The word "are" appears {count} times in this text.')

The word "are" appears 13 times in this text.





Copied!







def find_words_less_naive(word_to_find, text):
    words = text.split()
    matching_words = []
    for w in words:
        if w == word_to_find:
            matching_words.append(w)
    return len(matching_words)

def find_words_less_naive(word_to_find, text):
    words = text.split()
    matching_words = []
    for w in words:
        if w == word_to_find:
            matching_words.append(w)
    return len(matching_words)





Copied!







def find_words_less_naive(word_to_find, text):
    words = text.split()
    matching_words = []
    for w in words:
        if w == word_to_find:
            matching_words.append(w)
    return len(matching_words)

def find_words_less_naive(word_to_find, text):
    words = text.split()
    matching_words = []
    for w in words:
        if w == word_to_find:
            matching_words.append(w)
    return len(matching_words)





Copied!







import re

print(re.match(r"aa?bb?cc?", "cba"))
print(re.match(r"aa?bb?cc?", "abc"))
print(re.match(r"aa?bb?cc?", "abc").group())

import re

print(re.match(r"aa?bb?cc?", "cba"))
print(re.match(r"aa?bb?cc?", "abc"))
print(re.match(r"aa?bb?cc?", "abc").group())

None
<re.Match object; span=(0, 3), match='abc'>
abc





Copied!







import re

print(re.match(r"aa?bb?cc?", "cba"))
print(re.match(r"aa?bb?cc?", "abc"))
print(re.match(r"aa?bb?cc?", "abc").group())

import re

print(re.match(r"aa?bb?cc?", "cba"))
print(re.match(r"aa?bb?cc?", "abc"))
print(re.match(r"aa?bb?cc?", "abc").group())

None
<re.Match object; span=(0, 3), match='abc'>
abc





Copied!







import re

import re





Copied!







import re

import re

Strings	Expressions
BAbaaaccc	`[aA][bB]c*`
Bccc	`[aA]?[bB]ccc`
ab	`[aA][bB]c+`
ABccccc	`[aAbB]*c+`

Strings	Expressions
SnakeCaseVariableNames123	`\w+.com`
nlp.com	`\w+`
123432412394	`\w\w\w+[.]com`
Brown	`[A-Za-z][A-Za-z0-9]+`
n!com	`\d+`

Keys	Action
`?`	Open this help
`n`	Next page
`p`	Previous page
`s`	Search

Regular expressions: finding text within text¶

Learning outcomes¶

Exercise 1¶

Exercise 2¶

Exercise 3¶

Exercise 4¶

Exercise 5¶