XClose

COMP0023: Research Software Engineering With Python

Home
Menu

Types

We have seen that Python objects have a 'type':

In [1]:
type(5)
Out[1]:
int

Floats and integers

Python has two core numeric types, int for integer, and float for real number.

In [2]:
one = 1
ten = 10
one_float = 1.0
ten_float = 10.

Zero after a point is optional. But the Dot makes it a float.

In [3]:
tenth= one_float / ten_float
In [4]:
tenth
Out[4]:
0.1
In [5]:
type(one)
Out[5]:
int
In [6]:
type(one_float)
Out[6]:
float

The meaning of an operator varies depending on the type it is applied to! (And on the python version.)

In [7]:
print(one // ten)
0
In [8]:
one_float / ten_float
Out[8]:
0.1
In [9]:
print(type(one / ten))
<class 'float'>
In [10]:
type(tenth)
Out[10]:
float

The divided by operator when applied to floats, means divide by for real numbers. But when applied to integers, it means divide then round down:

In [11]:
10 // 3
Out[11]:
3
In [12]:
10.0 / 3
Out[12]:
3.3333333333333335
In [13]:
10 / 3.0
Out[13]:
3.3333333333333335

So if I have two integer variables, and I want the float division, I need to change the type first.

There is a function for every type name, which is used to convert the input to an output of the desired type.

In [14]:
x = float(5)
type(x)
Out[14]:
float
In [15]:
10 / float(3)
Out[15]:
3.3333333333333335

I lied when I said that the float type was a real number. It's actually a computer representation of a real number called a "floating point number". Representing $\sqrt 2$ or $\frac{1}{3}$ perfectly would be impossible in a computer, so we use a finite amount of memory to do it.

In [16]:
N = 10000.0
sum([1 / N] * int(N))
Out[16]:
0.9999999999999062

Strings

Python has a built in string type, supporting many useful methods.

In [17]:
given = "Terry"
family = "Jones"
full = given + " " + family

So + for strings means "join them together" - concatenate.

In [18]:
print(full.upper())
TERRY JONES

As for float and int, the name of a type can be used as a function to convert between types:

In [19]:
ten, one
Out[19]:
(10, 1)
In [20]:
print(ten + one)
11
In [21]:
print(float(str(ten) + str(one)))
101.0

We can remove extraneous material from the start and end of a string:

In [22]:
"    Hello  ".strip()
Out[22]:
'Hello'

Note that you can write strings in Python using either single (' ... ') or double (" ... ") quote marks. The two ways are equivalent. However, if your string includes a single quote (e.g. an apostrophe), you should use double quotes to surround it:

In [23]:
"Terry's animation"
Out[23]:
"Terry's animation"

And vice versa: if your string has a double quote inside it, you should wrap the whole string in single quotes.

In [24]:
'"Wow!", said John.'
Out[24]:
'"Wow!", said John.'

Lists

Python's basic container type is the list.

We can define our own list with square brackets:

In [25]:
[1, 3, 7]
Out[25]:
[1, 3, 7]
In [26]:
type([1, 3, 7])
Out[26]:
list

Lists do not have to contain just one type:

In [27]:
various_things = [1, 2, "banana", 3.4, [1,2] ]

We access an element of a list with an int in square brackets:

In [28]:
various_things[2]
Out[28]:
'banana'
In [29]:
index = 0
various_things[index]
Out[29]:
1

Note that list indices start from zero.

We can use a string to join together a list of strings:

In [30]:
name = ["Sir", "Michael", "Edward", "Palin"]
print("==".join(name))
Sir==Michael==Edward==Palin

And we can split up a string into a list:

In [31]:
"Ernst Stavro Blofeld".split(" ")
Out[31]:
['Ernst', 'Stavro', 'Blofeld']
In [32]:
"Ernst Stavro Blofeld".split("o")
Out[32]:
['Ernst Stavr', ' Bl', 'feld']

And combine these:

In [33]:
"->".join("John Ronald Reuel Tolkein".split(" "))
Out[33]:
'John->Ronald->Reuel->Tolkein'

A matrix can be represented by nesting lists -- putting lists inside other lists.

In [34]:
identity = [[1, 0], [0, 1]]
In [35]:
identity[0][0]
Out[35]:
1

... but later we will learn about a better way of representing matrices.

Ranges

Another useful type is range, which gives you a sequence of consecutive numbers. In contrast to a list, ranges generate the numbers as you need them, rather than all at once.

If you try to print a range, you'll see something that looks a little strange:

In [36]:
range(5)
Out[36]:
range(0, 5)

We don't see the contents, because they haven't been generatead yet. Instead, Python gives us a description of the object - in this case, its type (range) and its lower and upper limits.

We can quickly make a list with numbers counted up by converting this range:

In [37]:
count_to_five = range(5)
print(list(count_to_five))
[0, 1, 2, 3, 4]

Ranges in Python can be customised in other ways, such as by specifying the lower limit or the step (that is, the difference between successive elements). You can find more information about them in the official Python documentation.

Sequences

Many other things can be treated like lists. Python calls things that can be treated like lists sequences.

A string is one such sequence type.

Sequences support various useful operations, including:

  • Accessing a single element at a particular index: sequence[index]
  • Accessing multiple elements (a slice): sequence[start:end_plus_one]
  • Getting the length of a sequence: len(sequence)
  • Checking whether the sequence contains an element: element in sequence

The following examples illustrate these operations with lists, strings and ranges.

In [38]:
print(count_to_five[1])
1
In [39]:
print("Palin"[2])
l
In [40]:
count_to_five = range(5)
In [41]:
count_to_five[1:3]
Out[41]:
range(1, 3)
In [42]:
"Hello World"[4:8]
Out[42]:
'o Wo'
In [43]:
len(various_things)
Out[43]:
5
In [44]:
len("Python")
Out[44]:
6
In [45]:
name
Out[45]:
['Sir', 'Michael', 'Edward', 'Palin']
In [46]:
"Edward" in name
Out[46]:
True
In [47]:
3 in count_to_five
Out[47]:
True

Unpacking

Multiple values can be unpacked when assigning from sequences, like dealing out decks of cards.

In [48]:
mylist = ['Hello', 'World']
a, b = mylist
print(b)
World
In [49]:
range(4)
Out[49]:
range(0, 4)
In [50]:
zero, one, two, three = range(4)
In [51]:
two
Out[51]:
2

If there is too much or too little data, an error results:

In [52]:
zero, one, two, three = range(7)
---------------------------------------------------------------------------
ValueError                                Traceback (most recent call last)
Cell In[52], line 1
----> 1 zero, one, two, three = range(7)

ValueError: too many values to unpack (expected 4)
In [53]:
zero, one, two, three = range(2)
---------------------------------------------------------------------------
ValueError                                Traceback (most recent call last)
Cell In[53], line 1
----> 1 zero, one, two, three = range(2)

ValueError: not enough values to unpack (expected 4, got 2)

Python provides some handy syntax to split a sequence into its first element ("head") and the remaining ones (its "tail"):

In [54]:
head, *tail = range(4)
print("head is", head)
print("tail is", tail)
head is 0
tail is [1, 2, 3]

Note the syntax with the *. The same pattern can be used, for example, to extract the middle segment of a sequence whose length we might not know:

In [55]:
one, *two, three = range(10)
In [56]:
print("one is", one)
print("two is", two)
print("three is", three)
one is 0
two is [1, 2, 3, 4, 5, 6, 7, 8]
three is 9