Version Control with Git

Version control is essential for effective software development, collaborations on articles or proposals and is increasingly used to manage data and analysis workflows. This session will introduce git, show how to set up and update a repository as well as setup online repositories and collaborations.

Git has become the standard tool for version control which can be invaluable for researchers:

  • Nothing that is committed to version control is ever lost, unless you work really, really hard at it. Since all old versions of files are saved, it’s always possible to go back in time to see exactly who wrote what on a particular day, or what version of a program was used to generate a particular set of results.

  • As we have this record of who made what changes when, we know who to ask if we have questions later on, and, if needed, revert to a previous version, much like the “undo” feature in an editor.

  • When several people collaborate in the same project, it’s possible to accidentally overlook or overwrite someone’s changes. The version control system automatically notifies users whenever there’s a conflict between one person’s work and another’s.

Teams are not the only ones to benefit from version control: lone researchers can benefit immensely. Keeping a record of what was changed, when, and why is extremely useful for all researchers if they ever need to come back to the project later on (e.g., a year later, when memory has faded).

Version control is the lab notebook of the digital world: it’s what professionals use to keep track of what they’ve done and to collaborate with other people. Every large software development project relies on it, and most programmers use it for their small jobs as well. And it isn’t just for software: books, papers, small data sets, and anything that changes over time or needs to be shared can and should be stored in a version control system.

Prerequisites

In order to complete the tutorial you should be familiar with:

  • The linux command line

Materials

Lesson material: https://arc-bath.github.io/version-control-git/00_schedule.html

Book a place (University of Bath students and staff only)

Future lessons:

Dates Programme
24th September 14:15-17:05 Linux, Python, Testing, Working with Data
4th December 13:15-16:05 Python, Working with Data

Previous lesson instructor notebooks:

Date Instructor
25th April 2019 James Grant