Part 6

Writing files

So far we have read data from files, but it is naturally also possible to write data to files. Typically a program processes data and stores the results in a file, so they can be used later or processed further by some other program.

We can create a new file every time we want to write data to a file, but we can also append new data to the end of an existing file. In both cases we use the open function from the previous section. For writing files the function requires a second argument.

Creating a new file

If you want to create a new file, you would call the open function with the additional argument w, to signify that the file should be opened in write mode. So, the function call could look like this:

with open("new_file.txt", "w") as my_file:
    # code to write something to the file

NB: if the file already exists, all the contents will be overwritten. It pays to be very careful when creating new files.

With the file open you can write data to it. You can use the method write, which takes the string that is to be written as its argument.

with open("new_file.txt", "w") as my_file:
    my_file.write("Hello there!")

When you execute the program, a new file named new_file.txt appears in the directory. The contents would look like this:

Sample data

Hello there!

If you want line breaks in the file, you will have to add them by hand - the write function doesn't work exactly like the more familiar print function, though they are similar. So, the following program

with open("new_file.txt", "w") as my_file:
    my_file.write("Hello there!")
    my_file.write("This is the second line")
    my_file.write("This is the last line")

would result in a file with these contents:

Sample data

Hello there!This is the second lineThis is the last line

Line breaks are achieved by adding new line characters \n to the argument strings:

with open("new_file.txt", "w") as my_file:
    my_file.write("Hello there!\n")
    my_file.write("This is the second line\n")
    my_file.write("This is the last line\n")

Now the contents of new_file.txt would look like this:

Sample data

Hello there! This is the second line This is the last line

Programming exercise:

Inscription

Points:
Loading...

/

Loading...

Please write a program which asks for the name of the user and then creates an "inscription" in a file specified by the user. Please see the example below.

Sample output

Whom should I sign this to: Ada Where shall I save it: inscribed.txt

The contents of the file inscribed.txt would be

Sample data

Hi Ada, we hope you enjoy learning Python with us! Best, Mooc.fi Team

NB: this exercise doesn't ask you to write any functions, so you should not place any code within an if __name__ == "__main__" block.

Appending data to an existing file

If you want to append data to the end of a file, instead of overwriting the entire file, you should open the file in append mode with the argument a.

If the file doesn't yet exist, append mode works exatly like write mode.

The following program opens the file new_file.txt and appends a couple of lines of text to the end:

with open("new_file.txt", "a") as my_file:
    my_file.write("This is the 4th line\n")
    my_file.write("And yet another line.\n")

After this program is executed the contents of the file would look like this:

Sample output

Hello there! This is the second line This is the last line This is the 4th line And yet another line.

In programming practice, appending data to files is not a very common task.

More often a file is read, processed and overwritten in its entirety. For example, when the contents should change in the middle of the file, it is usually easiest to overwrite the entire file.

Programming exercise:

Diary

Points:
Loading...

/

Loading...

Please write a program which works as a simply diary. The diary entries should be saved in the file diary.txt. When the program is executed, it should first read any entries already in the file.

NB: the automatic tests for this exercise will change the contents of the file. If you want to keep its contents, first make a copy of the file under a different name.

The program should work as follows:

Sample output

1 - add an entry, 2 - read entries, 0 - quit Function: 1 Diary entry: Today I ate porridge Diary saved

1 - add an entry, 2 - read entries, 0 - quit Function: 2 Entries: Today I ate porridge 1 - add an entry, 2 - read entries, 0 - quit Function: 1 Diary entry: I went to the sauna in the evening Diary saved

1 - add an entry, 2 - read entries, 0 - quit Function: 2 Entries: Today I ate porridge I went to the sauna in the evening 1 - add an entry, 2 - read entries, 0 - quit Function: 0 Bye now!

When the program is executed for the second time, this should happen:

Sample output

1 - add an entry, 2 - read entries, 0 - quit Function: 2 Entries: Today I ate porridge I went to the sauna in the evening 1 - add an entry, 2 - read entries, 0 - quit Function: 0 Bye now!

NB: this exercise doesn't ask you to write any functions, so you should not place any code within an if __name__ == "__main__" block.

Writing CSV files

CSV files can be written line by line with the write method just like any other file. The following example creates the file coders.csv, with each line containing the name, working environment, favourite language and years of experience of a single programmer. The fields are separated by a semicolon.

with open("coders.csv", "w") as my_file:
    my_file.write("Eric;Windows;Pascal;10\n")
    my_file.write("Matt;Linux;PHP;2\n")
    my_file.write("Alan;Linux;Java;17\n")
    my_file.write("Emily;Mac;Cobol;9\n")

Executing this program would result in the following file:

Sample output

Eric;Windows;Pascal;10 Matt;Linux;PHP;2 Alan;Linux;Java;17 Emily;Mac;Cobol;9

What if the data to be written is stored in computer memory in a list?

coders = []
coders.append(["Eric", "Windows", "Pascal", 10])
coders.append(["Matt", "Linux", "PHP", 2])
coders.append(["Alan", "Linux", "Java", 17])
coders.append(["Emily", "Mac", "Cobol", 9])

We can build the string we want to write as an f-string, and write the ready line to the file like so:

with open("coders.csv", "w") as my_file:
    for coder in coders:
        line = f"{coder[0]};{coder[1]};{coder[2]};{coder[3]}"
        my_file.write(line+"\n")

If each list of coder data was very long, with many more items, building the string by hand would be quite cumbersome. We can use a for loop to build the string instead:

with open("coders.csv", "w") as my_file:
    for coder in coders:
        line = ""
        for value in coder:
            line += f"{value};"
        line = line[:-1]
        my_file.write(line+"\n")

Clearing file contents and deleting files

Sometimes it is necessary to clear the contents of an existing file. Opening the file in write mode and closing the file immediately will achieve just this:

with open("file_to_be_cleared.txt", "w") as my_file:
    pass

Now the with block only contains the command pass, which doesn't actually do anything. Python does not allow empty blocks, so the command is necessary here.

It is possible to also bypass the with block by using the following oneliner:

open('file_to_be_cleared.txt', 'w').close()
Programming exercise:

Filtering the contents of a file

Points:
Loading...

/

Loading...

The file solutions.csv contains some solutions to mathematics problems:

Arto;2+5;7
Pekka;3-2;1
Erkki;9+3;11
Arto;8-3;4
Pekka;5+5;10
...jne...

As you can see above, on each line the format is name_of_student;problem;result. All the operations are either addition or subtraction, and each has exactly two operands.

Please write a function named filter_solutions() which

  • Reads the contents of the file solutions.csv
  • writes those lines which have a correct result into the file correct.csv
  • writes those lines which have an incorrect result into the file incorrect.csv

Using the example above, the file correct.csv would contain the lines

Arto;2+5;7
Pekka;3-2;1
Pekka;5+5;10

The other two would be in the file incorrect.csv.

Please write the lines in the same order as they appear in the original file. Do not change the original file.

NB: the function should have the exact same result, no matter how many times it is called. That is, it shouldn't matter if the function is called once

filter_solutions()

or multiple times in a row

filter_solutions()
filter_solutions()
filter_solutions()
filter_solutions()

After the execution, the contents of the files correct.csv and incorrect.csv should be exactly the same in either case.

Programming exercise:

Store personal data

Points:
Loading...

/

Loading...

Please write a function named store_personal_data(person: tuple), which takes a tuple containing some identifying information as its argument.

The tuple contains the following elements:

  • Name (string)
  • Age (integer)
  • Height (float)

This should be processed and written into the file people.csv. The file may already contain some data; the new entry goes to the end of the file. The data should be written in the format

name;age;height

Each entry should be on a separate line. If we call the function with the argument ("Paul Paulson", 37, 175.5), the function should write this line to the end of the file:

Paul Paulson;37;175.5

Handling data in a CSV format

Let's write a program which assesses students' performance on a course. The program reads a CSV file, which contains weekly exercise points received by the students. The program then calculates the points total and determines the grade attained by each student. Finally, the program creates a CSV file containing the points total and grade for each student.

The CSV file given as input to the program looks like this:

Sample data

Peter;4;2;3;5;4;0;0 Paula;7;2;8;3;5;4;5 Susan;3;4;3;5;3;4;4 Emily;6;6;5;5;0;4;8

The program logic is divided into three functions: reading the file and processing the contents into an accessible format, determining the grade, and writing the file.

The file is read following the principles covered in the previous section. The data is stored in a dictionary, where the key is the student's name, and the value is a list of the points received by the student, in integer format:

def read_weekly_points(filename):
    weekly_points = {}
    with open(filename) as my_file:
        for line in my_file:
            parts = line.split(";")
            point_list = []
            for points in parts[1:]:
                point_list.append(int(points))
            weekly_points[parts[0]] = point_list

    return weekly_points

The second function is for determining the grade based on the points received. This function is in turn used by the third function, which writes the results to the file.

def grade(points):
    if points < 20:
        return 0
    elif points < 25:
        return 1
    elif points < 30:
        return 2
    elif points < 35:
        return 3
    elif points < 40:
        return 4
    else:
        return 5

def save_results(filename, weekly_points):
    with open(filename, "w") as my_file:
        for name, point_list in weekly_points.items():
            point_sum = sum(point_list)
            my_file.write(f"{name};{point_sum};{grade(point_sum)}\n")

This structure lets us write a very simple main function. Notice how the filenames for the files whch are read and written are given as arguments in the main function:

weekly_points = read_weekly_points("weekly_points.csv")
save_results("results.csv", weekly_points)

When the main function is executed, the contents of the file results.csv created as a result looks like this:

Sample data

Peter;18;0 Paula;34;3 Susan;26;2 Emily;41;5

Notice how each function defined above is relatively simple, and they all have a single responsibility. This is a common and advisable approach when programming larger wholes. The single reponsibility principle makes verifying functionality easier. It also makes it easier to make changes to the program later, and to add new features.

Say we wanted to add a function for printing out the grade for a single student. We already have a function which determines the student's grade, so we can use this in our new function:

def get_grade(student_name, weekly_points):
    for name, point_list in weekly_points.items():
        if name == student_name:
            return grade(sum(point_list))


weekly_points = read_weekly_points("weekly_points.csv")
print(get_grade("Paula", weekly_points))
Sample data

3

If we determine a certain functionality in the program needs fixing, in a well designed program the change will affect only some select sections of code, and it will be easier to determine where the changes should be made. For example, if we wanted to change the grade boundaries, we'd only need to implement the change in the function for determining the grade, and it would work also in all the other functions utilizing this function. If the code for this single functionality was implemented in multiple places, there would be a definite risk that we would not remember to change all the instances when changing the functionality.

Programming exercise:

Course grading, part 4

Points:
Loading...

/

Loading...

Let's revisit the course grading project from the previous section.

As we left if last time, the program read and processed files containing student information, completed exercises and exam results. We'll add a file containing information about the course. An example of the format of the file:

Sample data
name: Introduction to Programming
study credits: 5

The program should then create two files. There should be a file called results.txt with the following contents:

Sample data
Introduction to Programming, 5 credits
======================================
name                          exec_nbr  exec_pts. exm_pts.  tot_pts.  grade
pekka peloton                 21        5         9         14        0
jaana javanainen              27        6         11        17        1
liisa virtanen                35        8         14        22        3

The statistics section is identical to the results printed out in part 3 of the project. The only addition here is the header section.

Additionally, there should be a file called results.csv with the following format:

Sample data
12345678;pekka peloton;0
12345687;jaana javanainen;1
12345699;liisa virtanen;3

When the program is executed, it should look like this:

Sample output

Student information: students1.csv Exercises completed: exercises1.csv Exam points: exam_points1.csv Course information: course1.txt Results written to files results.txt and results.csv

That is, the program only asks for the names of the input files. All output should be written to the files. The user will only see a message confirming this.

NB: this exercise doesn't ask you to write any functions, so you should not place any code within an if __name__ == "__main__" block.

Programming exercise:

Dictionary stored in a file

Points:
Loading...

/

Loading...

Please write a program which functions as a dictionary. The user can type in new entries or look for existing entries.

The program should work as follows:

Sample output

1 - Add word, 2 - Search, 3 - Quit Function: 1 The word in Finnish: auto The word in English: car Dictionary entry added 1 - Add word, 2 - Search, 3 - Quit Function: 1 The word in Finnish: roska The word in English: garbage Dictionary entry added 1 - Add word, 2 - Search, 3 - Quit Function: 1 The word in Finnish: laukku The word in English: bag Dictionary entry added 1 - Add word, 2 - Search, 3 - Quit Function: 2 Search term: bag roska - garbage laukku - bag 1 - Add word, 2 - Search, 3 - Quit Function: 2 Search term: car auto - car 1 - Add word, 2 - Search, 3 - Quit Function: 2 Search term: laukku laukku - bag 1 - Add word, 2 - Search, 3 - Quit Function: 3 Bye!

The dictionary entries should be written to a file called dictionary.txt. The program should first read the contents of the file. New entries are written to the end of the file whenever they are added to the dictionary.

The format of the data stored in the dictionary is up to you.

NB: the automatic tests for this exercise may change the contents of the file. If you want to keep its contents, first make a copy of the file under a different name.

NB2: this exercise doesn't ask you to write any functions, so you should not place any code within an if __name__ == "__main__" block.

You have reached the end of this section! Continue to the next section:

You can check your current points from the blue blob in the bottom-right corner of the page.