CS50P Problem Set 7

NUMB3RS

An IPv4 address is typically formatted in dot-decimal notation as #.#.#.#. But each # should be a number between 0 and 255, inclusive.

In a file called numb3rs.py, implement a functions called validatethat expects an IPv4 address as an input as a str and then returns True or False, respectively, if that input is a valid IPv4 address or not.

Structure numb3ers.py as follows, wherein you're welcome to modify main and/or implement other functions as you see fit, but you may not import any other libraries, You're welcome, but not required, to use re and/or sys.

import re
import sys

def main():
    print(validate(input("IPv4 Address: ")))

def validate(ip):
    ...

...

if __name__ == "__main__":
    main()

Either before or after you implement validate in numb3ers.py, additionally implement, in a file called test_numb3ers.py, two or more functions that collectively test your implementation of validate thoroughly.

import re
import sys

pattern = (r"^([0-9]|[1-9][0-9]|1[0-9][0-9]|2[0-4][0-9]|25[0-5])\.([0-9]|[1-9][0-9]|1[0-9][0-9]|2[0-4][0-9]|25[0-5])\.([0-9]|[1-9][0-9]|1[0-9][0-9]|2[0-4][0-9]|25[0-5])\.([0-9]|[1-9][0-9]|1[0-9][0-9]|2[0-4][0-9]|25[0-5])$")

match = re.search(pattern, ip)

import pytest
from numb3rs import validate

def test_one_digit():
    assert validate("1.2.3.4") == True

Watch on YouTube

In a YouTube video, if you click Share, then Embed, you will see the HTML code that you can copy into your own website's source code.

<iframe width="560" height="315" src="https://www.youtube.com/embed/xvFZjo5PgG0" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture" allowfullscreen></iframe>

iframe is an HTML "element", and src is one of several HTML "attributes" therein, the value of which, between quotes, is the URL of the video.

Because some HTML attributes are optional, you could instead minimally embed:

<iframe src="https://www.youtube.com/embed/xvFZjo5PgG0"></iframe>

Suppose that you’d like to extract the URLs of YouTube videos that are embedded in pages (e.g., https://www.youtube.com/embed/xvFZjo5PgG0), converting them back to shorter, shareable youtu.be URLs (e.g., https://youtu.be/xvFZjo5PgG0) where they can be watched on YouTube itself.

Implement a function called parse that expects a str of HTML as input, extracts any YouTube URL that's the value of a src attribute of an iframe element, and returns its shorter, shareable youtu.be equivalent as a str.

Expect that any such URL will be in one of the formats below.
Assume that the value of src will be surrounded by double quotes.
Assume that the input will contain no more than one such URL.
If the input does not contain any such URL at all, return None.

http://youtube.com/embed/xvFZjo5PgG0
https://youtube.com/embed/xvFZjo5PgG0
https://www.youtube.com/embed/xvFZjo5PgG0

Structure watch.py as follows, wherein you're welcome to modify main and/or implement other functions as you see fit, but you may not import any other libraries. You are welcome, but not required, to use re and/or sys.

import re
import sys

def main():
    print(parse(input("HTML: ")))

def parse(s):
    ...

...

if __name__ == "__main__":
    main()

Main

if matches := re.search(r'src="https?://(?:www\.)?youtube\.com/embed/([a-zA-Z0-9_-]+)"', s):
    return f"https://youtu.be/{matches.group(1)}"

Working 9 to 5

Most countries use a 24-hour clock, the United States tends to use a 12-hour clock. Instead of 09:00 to 17:00, many Americans would say 9:00 AM to 5:00PM ("AM" is an abbreviation for "ante meridiem" and "PM" is an abbreviation of "post meridiem", "meridiem" means midday or noon).

Just as 12:00 AM in 12-hour format would be 00:00 in 24-hour format, so would 12:01 AM through 12:59 AM be 00:01 through 00:59, respectively.

In a file called working.py, implement a function called convert that expects a str in any of the 12-hour formats below and returns the corresponding str in 24-hour format.

Expect that AM and PM will be capitalized (with no periods) and that there will be a space before each.
Assume that these times are representative of actual times.

9:00 AM to 5:00 PM
9 AM to 5 PM
9:00 AM to 5 PM
9 AM to 5:00 PM

Raise a ValueError instead if the input to convert is not either of those formats or if either time is invalid (e.g., 12:60 AM, 13:00 PM, etc.).
Do not assume that someone's hours will start ante meridiem and end post meridiem.

Either before or after you implement convert in working.py, additionally implement, in a file called test_working.py, three or more functions that collectively test your implementation of convert thoroughly.

working.py

import re
import sys

pattern = r"^(1[0-2]|[1-9]):?([0-5][0-9])?\s(AM|PM)\sto\s(1[0-2]|[1-9]):?([0-5][0-9])?\s(AM|PM)$"

if match := re.search(pattern, s):

    if not "12" in match.group(1) and "PM" in match.group(3):
        hours1 = int(match.group(1)) + 12
    elif "12" in match.group(1) and "AM" in match.group(3):
                hours1 = 00
    else:
        hours1 = int(match.group(1))

    minutes1 = match.group(2) if match.group(2) else "00"

    ...
        ...

return f"{hours1:02}:{minutes1} to {hours2:02}:{minutes2}"

except ValueError:
    raise ValueError("Invalid time format")

test_working.py

import pytest
from working import convert

def test_hours():
    assert convert("9 AM to 5 PM") == "09:00 to 17:00"
    ...

def test_hours_minutes():
    assert convert("9:21 AM to 5:11 PM") == "09:21 to 17:11"
    ...

def test_exception():
    with pytest.raises(ValueError):
        convert("9:60 AM to 5:60 PM")

Regular, um, Expressions

In a file called um.py, implement a function called count that expects a line of text as input as a str and returns, as an int, the number of times that "um" appears in that text, case-insensitively, as a word unto itself, not as a substring of some other word (like in yummy).

Either before or after you implement count in um.py, additionally implement, in a file called test_um.py, three or more functions that collectively test your implementation of count thoroughly.

um.py

pattern = r"\bum\b"
matches = re.findall(pattern, s, re.IGNORECASE)

count = len(matches)

test_um.py

def test_just_um():
    assert count("um") == 1
    assert count("um um") == 2
    ...

def test_sentences():
    assert count("Thanks for, um, the ride.") == 1
    assert count("Um, what's this, um... thingy?") == 2
    ...

def test_part_of_word():
    assert count("circumstances") == 0
    assert count("instrument") == 0
    ...

Response Validation

When creating a Google Form that prompts users for an answer, it's possible to enable response validation and require that the user's input match a regular expression.

In a file called response.py, using either validator-collection or validators from PyPI, implement a program that prompts the user for an email address via input and then print Valid or Invalid, respectively, if the input is a syntactically valid email address.

You may not use re.
Do not validate whether the email address's domain name actually exists.

import validators

def validate(e):
    if validators.email(e):
        return "Valid"
    else:
        return "Invalid"

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

problem_set7.md

problem_set7.md

CS50P Problem Set 7

NUMB3RS

Watch on YouTube

Working 9 to 5

Regular, um, Expressions

Response Validation

Files

problem_set7.md

Latest commit

History

problem_set7.md

File metadata and controls

CS50P Problem Set 7

NUMB3RS

Watch on YouTube

Working 9 to 5

Regular, um, Expressions

Response Validation