XClose

COMP0233: Research Software Engineering With Python

Home
Menu

Types

We have seen that Python objects have a 'type':

In [1]:
type(5)
Out[1]:
int

Floats and integers

Python has two core numeric types, int for integer, and float for real number.

In [2]:
one = 1
ten = 10
one_float = 1.0
ten_float = 10.

The zero after a decimal point is optional - it is the Dot makes it a float. However, it is better to always include the zero to improve readability.

In [3]:
tenth= one_float / ten_float
In [4]:
tenth
Out[4]:
0.1
In [5]:
type(one)
Out[5]:
int
In [6]:
type(one_float)
Out[6]:
float

The meaning of an operator varies depending on the type it is applied to!

In [7]:
print(1 + 2)  # returns an integer
3
In [8]:
print(1.0 + 2.0)  # returns a float
3.0

The division by operator always returns a float, whether it's applied to floats or ints.

In [9]:
10 / 3
Out[9]:
3.3333333333333335
In [10]:
10.0 / 3
Out[10]:
3.3333333333333335
In [11]:
10 / 3.0
Out[11]:
3.3333333333333335

To perform integer division we need to use the divmod function, which returns the quotiant and remainder of the division.

In [12]:
quotiant, remainder = divmod(10, 3)
print(f"{quotiant=}, {remainder=}")
quotiant=3, remainder=1

Note that if either of the input type are float, the returned values will also be floats.

In [13]:
divmod(10, 3.0)
Out[13]:
(3.0, 1.0)

There is a function for every built-in type, which is used to convert the input to an output of the desired type.

In [14]:
x = float(5)
type(x)
Out[14]:
float
In [15]:
divmod(10, float(3))
Out[15]:
(3.0, 1.0)

I lied when I said that the float type was a real number. It's actually a computer representation of a real number called a "floating point number". Representing $\sqrt 2$ or $\frac{1}{3}$ perfectly would be impossible in a computer, so we use a finite amount of memory to do it.

In [16]:
N = 10000.0
sum([1 / N] * int(N))
Out[16]:
0.9999999999999062

Strings

Python has a built in string type, supporting many useful methods.

In [17]:
given = "Terry"
family = "Jones"
full = given + " " + family

So + for strings means "join them together" - concatenate.

In [18]:
print(full.upper())
TERRY JONES

As for float and int, the name of a type can be used as a function to convert between types:

In [19]:
ten, one
Out[19]:
(10, 1)
In [20]:
print(ten + one)
11
In [21]:
print(float(str(ten) + str(one)))
101.0

We can remove extraneous material from the start and end of a string:

In [22]:
"    Hello  ".strip()
Out[22]:
'Hello'

Note that you can write strings in Python using either single (' ... ') or double (" ... ") quote marks. The two ways are equivalent. However, if your string includes a single quote (e.g. an apostrophe), you should use double quotes to surround it:

In [23]:
"Terry's animation"
Out[23]:
"Terry's animation"

And vice versa: if your string has a double quote inside it, you should wrap the whole string in single quotes.

In [24]:
'"Wow!", said John.'
Out[24]:
'"Wow!", said John.'

Lists

Python's basic container type is the list.

We can define our own list with square brackets:

In [25]:
[1, 3, 7]
Out[25]:
[1, 3, 7]
In [26]:
type([1, 3, 7])
Out[26]:
list

Lists do not have to contain just one type:

In [27]:
various_things = [1, 2, "banana", 3.4, [1,2] ]

We access an element of a list with an int in square brackets:

In [28]:
various_things[2]
Out[28]:
'banana'
In [29]:
index = 0
various_things[index]
Out[29]:
1

Note that list indices start from zero.

We can use a string to join together a list of strings:

In [30]:
name = ["Sir", "Michael", "Edward", "Palin"]
print("==".join(name))
Sir==Michael==Edward==Palin

And we can split up a string into a list:

In [31]:
"Ernst Stavro Blofeld".split(" ")
Out[31]:
['Ernst', 'Stavro', 'Blofeld']
In [32]:
"Ernst Stavro Blofeld".split("o")
Out[32]:
['Ernst Stavr', ' Bl', 'feld']

And combine these:

In [33]:
"->".join("John Ronald Reuel Tolkein".split(" "))
Out[33]:
'John->Ronald->Reuel->Tolkein'

A matrix can be represented by nesting lists -- putting lists inside other lists.

In [34]:
identity = [[1, 0], [0, 1]]
In [35]:
identity[0][0]
Out[35]:
1

... but later we will learn about a better way of representing matrices.

Ranges

Another useful type is range, which gives you a sequence of consecutive numbers. In contrast to a list, ranges generate the numbers as you need them, rather than all at once.

If you try to print a range, you'll see something that looks a little strange:

In [36]:
range(5)
Out[36]:
range(0, 5)

We don't see the contents, because they haven't been generatead yet. Instead, Python gives us a description of the object - in this case, its type (range) and its lower and upper limits.

We can quickly make a list with numbers counted up by converting this range:

In [37]:
count_to_five = range(5)
print(list(count_to_five))
[0, 1, 2, 3, 4]

Ranges in Python can be customised in other ways, such as by specifying the lower limit or the step (that is, the difference between successive elements). You can find more information about them in the official Python documentation.

Sequences

Many other things can be treated like lists. Python calls things that can be treated like lists sequences.

A string is one such sequence type.

Sequences support various useful operations, including:

  • Accessing a single element at a particular index: sequence[index]
  • Accessing multiple elements (a slice): sequence[start:end_plus_one]
  • Getting the length of a sequence: len(sequence)
  • Checking whether the sequence contains an element: element in sequence

The following examples illustrate these operations with lists, strings and ranges.

In [38]:
print(count_to_five[1])
1
In [39]:
print("Palin"[2])
l
In [40]:
count_to_five = range(5)
In [41]:
count_to_five[1:3]
Out[41]:
range(1, 3)
In [42]:
"Hello World"[4:8]
Out[42]:
'o Wo'
In [43]:
len(various_things)
Out[43]:
5
In [44]:
len("Python")
Out[44]:
6
In [45]:
name
Out[45]:
['Sir', 'Michael', 'Edward', 'Palin']
In [46]:
"Edward" in name
Out[46]:
True
In [47]:
3 in count_to_five
Out[47]:
True

Unpacking

Multiple values can be unpacked when assigning from sequences, like dealing out decks of cards.

In [48]:
mylist = ['Hello', 'World']
a, b = mylist
print(b)
World
In [49]:
range(4)
Out[49]:
range(0, 4)
In [50]:
zero, one, two, three = range(4)
In [51]:
two
Out[51]:
2

If there is too much or too little data, an error results:

In [52]:
zero, one, two, three = range(7)
---------------------------------------------------------------------------
ValueError                                Traceback (most recent call last)
Cell In[52], line 1
----> 1 zero, one, two, three = range(7)

ValueError: too many values to unpack (expected 4)
In [53]:
zero, one, two, three = range(2)
---------------------------------------------------------------------------
ValueError                                Traceback (most recent call last)
Cell In[53], line 1
----> 1 zero, one, two, three = range(2)

ValueError: not enough values to unpack (expected 4, got 2)

Python provides some handy syntax to split a sequence into its first element ("head") and the remaining ones (its "tail"):

In [54]:
head, *tail = range(4)
print("head is", head)
print("tail is", tail)
head is 0
tail is [1, 2, 3]

Note the syntax with the *. The same pattern can be used, for example, to extract the middle segment of a sequence whose length we might not know:

In [55]:
one, *two, three = range(10)
In [56]:
print("one is", one)
print("two is", two)
print("three is", three)
one is 0
two is [1, 2, 3, 4, 5, 6, 7, 8]
three is 9