A document from MCS 260 Fall 2021, instructor Emily Dumas. You can also get the notebook file.

MCS 260 Fall 2021 Worksheet 4

  • Course instructor: Emily Dumas

Topics

The main topics on this worksheets are strings, integers, and list comprehensions.

Instructions

  • Work on the exercises below during lab. Complete them afterward if you don't finish during the lab period.
  • This worksheet will prepare you for the upcoming homework assignment.
  • Most of the exercises ask you to write Python scripts to accomplish a certain task. We recommend making a folder (with a name like "worksheet3") to store the scripts you prepare today.
  • Seek assistance from your TA or from fellow students if you run into trouble.

Collaboration

Collaboration on worksheets is strongly encouraged.

Resources

The main course materials to refer to for this worksheet are:

(Lecture videos are not linked on worksheets, but are also useful to review while working on worksheets. Video links can be found in the course course Blackboard site.)

1. Indexing practice

Here is a list of lists:

In [1]:
Q = [ ["Elizabeth","Victoria","Elizabeth"],
      ["Eleanor","Blanche","Margaret","Isabella","Anne","Catherine","Marguerite","Mary","Anne"],
      ["Padmini","Chennamma","Sultana","Holkar"],
      ["Artemisia","Kratesipolis","Nikaia","Olympias"]
    ]

In the list Q, each item is itself a list. So Q[0] refers to the item at index 0,

In [4]:
Q[0]
Out[4]:
['Elizabeth', 'Victoria', 'Elizabeth']

And these items that are lists can be further indexed. Here we take the last element of Q[0]:

In [5]:
Q[0][-1]
Out[5]:
'Elizabeth'

Determine indexing or slicing expressions like this that match the following descriptions, and evaluate them to make sure they give the expected values:

  1. The second to last element of the last element of Q
  2. Q with its first and last elements dropped
  3. All but the last element of the second element of Q (index 1)
  4. All of the index expressions using only positive numbers that evaluate to "Anne"
    • Note: You don't need to write code to check for the value "Anne". Rather, this problem is checking whether you can find it yourself and then translate what you see into a set of numerical indices to use.
  5. The third letter of the third string in the third item of Q

2. List comprehension practice

We continue working with the list of lists Q from problem 1.

Write a list comprehension to do each of the following:

  1. A list of the lengths of the items of Q, which should be [3,9,4,4]
  2. A list of the middle letters of all the strings in Q[1] (use integer division to select the index, which selects the earlier of the two possibilites if a string has no middle letter)
  3. A list of all the strings from Q[1] that contain the letter n.
  4. A list of all the strings from Q[1] that contain the letter n at least two times.
  5. A list that looks like Q[3], but where every word has been replaced by the same number of stars, i.e. ['*********', '************', '******', '********']
  6. A list of strings that is like Q[2] but with any entry equal to "Sultana" dropped and consecutive number labels added to the remaining ones, i.e. ["'1. Padmini", "2. Chennamma", "3. Holkar"]

Note that you should be writing a single line of code consisting of a list comprehension for each answer. There are other ways you could solve these problems with the Python we've learned, but this exercise is testing your ability to solve them with list comprehensions.

3. Adventures in unicode and escape sequences

Look up code point numbers from the charts at https://www.compart.com/en/unicode/block or https://unicode.org/charts/ and use them to make Python code that uses \uXXXX escape sequences to print the following strings. This means your code should only contain characters that appear on a standard US keyboard, but should produce the accented or non-latin characters shown here when it runs:

  1. Pudín de limón
  2. That's the "φ" model
  3. Mathematicians often use the letter-like mathematical symbol ℂ for the set of complex numbers

Accented i and o are in the "Latin-1 supplement", while the letter phi is in the "Greek and Coptic" block.

Note: The default font used by Windows Powershell doesn't include ℂ, and instead of looking for another font which includes that character, it will just produce a symbol that looks like a square with a question mark inside. (That's a sign that a character is present, but can't be shown in the current font.) To confirm you've actually produced ℂ, you can copy the mystery character into the clipboard and paste it into another application (e.g. Discord, Word, the search bar in your web browser, ...).

4. Indexer

Write a Python program indexer.py that lets a user enter any number of strings, and which remembers each distinct string it has seen in a list. If it receives a string it has seen before, it tells the user the 0-based index where that string appears in the list.

More precisely, when the program is run, it starts with an empty list. It then enters an endless loop of doing the following:

  • Display a prompt -> and wait for the user to type a string
  • If the string they entered is already in the list, print a message showing the index where it appears
  • If the string they entered is not in the list, add it to the end of the list and print a message showing its new index

Here's a sample session of using the program:

->David
David = 0  (new)
->Grogu
Grogu = 1  (new)
->Groudon
Groudon = 2  (new)
->Lisa
Lisa = 3  (new)
->David
David = 0
->Tina
Tina = 4  (new)
->Chin-Sun
Chin-Sun = 5  (new)
->Grogu
Grogu = 1
->Chin-Sun
Chin-Sun = 5
->David
David = 0

Notice that the first time David is entered, it is new and gets the index 0, but the next time it is entered, the program recognizes that it was seen before and again reports the index as 0.

Bonus round

Work on these if you finish the problems above.

  • Modify the indexer program so that if you enter a question mark ?as the string, instead of having that added to the list, the program prints the entire list (and then continues as usual, returning to the -> prompt)
  • Modify the indexer program further so that if a string begins with -, then the program interprets the rest of the string as one that should be removed from the list. So, for example, entering David means to add David to the list or report its index if it is already there, while -David means to remove David from the list.
  • Of the first 100 powers of 3, how many contain the digit 7? Can you write a single line of code that evaluates to the answer, using a list comprehension?

Revision history

  • 2021-09-13 Initial release