XClose

COMP0233: Research Software Engineering With Python

Home
Menu

Working with multiple remotes

NOTE: using bash/git commands is not fully supported on jupyterlite yet (due to single thread/process restriction), and the cells below might error out on the browser (jupyterlite) version of this notebook

Distributed versus centralised

Older version control systems (cvs, svn) were "centralised"; the history was kept only on a server, and all commits required an internet.

Centralised Distributed
Server has history Every user has full history
Your computer has one snapshot Many local branches
To access history, need internet History always available
You commit to remote server Users synchronise histories
cvs, subversion(svn) git, mercurial (hg), bazaar (bzr)

With modern distributed systems, we can add a second remote. This might be a personal fork on github:

In [1]:
import os
top_dir = os.getcwd()
git_dir = os.path.join(top_dir, 'learning_git')
working_dir = os.path.join(git_dir, 'git_example')
os.chdir(working_dir)
In [2]:
%%bash
git switch main
git remote add arc git@github.com:UCL-ARC-RSEing-with-Python/github-example.git
git remote -v
Switched to branch 'main'
Your branch and 'origin/main' have diverged,
and have 20 and 11 different commits each, respectively.
  (use "git pull" if you want to integrate the remote branch with yours)
arc	git@github.com:UCL-ARC-RSEing-with-Python/github-example.git (fetch)
arc	git@github.com:UCL-ARC-RSEing-with-Python/github-example.git (push)
origin	git@github.com:UCL/github-example.git (fetch)
origin	git@github.com:UCL/github-example.git (push)

We can push to a named remote:

In [3]:
%%writefile Pennines.md

Mountains In the Pennines
========================

* Cross Fell
* Whernside
Overwriting Pennines.md
In [4]:
%%bash
git add Pennines.md
git commit -m "Add Whernside"
[main dc41776] Add Whernside
 1 file changed, 1 insertion(+), 1 deletion(-)
In [5]:
%%bash
git push -uf arc main
To github.com:UCL-ARC-RSEing-with-Python/github-example.git
 + 41374a0...dc41776 main -> main (forced update)
branch 'main' set up to track 'arc/main'.

Referencing remotes

You can always refer to commits on a remote like this:

In [6]:
%%bash
git fetch
git log --oneline --left-right arc/main...origin/main
From github.com:UCL-ARC-RSEing-with-Python/github-example
 * [new branch]      gh-pages   -> arc/gh-pages
< dc41776 Add Whernside
< f34ec00 Add github pages YAML frontmatter
< 4f8ee3e Add a makefile and ignore generated files
< 19b10b8 Add Pennines
> 9e2c205 Merge branch 'main' of github.com:UCL/github-example
> 283eda3 Add a beacon
> 00bedc5 Translating from the Welsh
> e131e11 Merge branch 'main' of github.com:UCL/github-example
< 429f6ad Merge branch 'experiment'
> df3e73a Add wales
> 30410a7 Add Scotland
< 88ee098 Commit Aonach onto main branch
< 41b3b35 Add Cadair Idris
> 8a9615a Add Helvellyn
> 1a8249b Include lakes in the scope
> 7bef409 Add lakeland
< 8e1c96e Merge branch 'main' of github.com:UCL/github-example
< ff8a609 Add Glyder
< dbb2bcd Add another Beacon
< 4326076 Merge branch 'main' of github.com:UCL/github-example
< e59bb35 Translating from the Welsh
< ab1254f Add a beacon
< 212e4e6 Merge branch 'main' of github.com:UCL/github-example
< 30ceeb4 Add wales
< 36010d9 Add Scotland
> 8c41bc4 Add a lie about a mountain
> 8f2a031 First commit of discourse on UK topography
< eeaf05c Add Helvellyn
< 684a361 Include lakes in the scope
< be1d265 Add lakeland
< 9c8446b Add a lie about a mountain
< 4415406 First commit of discourse on UK topography

To see the differences between remotes, for example.

To see what files you have changed that aren't updated on a particular remote, for example:

In [7]:
%%bash
git diff --name-only origin/main
.gitignore
Makefile
Pennines.md
Scotland.md
Wales.md
index.md

When you reference remotes like this, you're working with a cached copy of the last time you interacted with the remote. You can do git fetch to update local data with the remotes without actually pulling. You can also get useful information about whether tracking branches are ahead or behind the remote branches they track:

In [8]:
%%bash
git branch -vv
  gh-pages f34ec00 [origin/gh-pages] Add github pages YAML frontmatter
* main     dc41776 [arc/main] Add Whernside

Hosting Servers

Hosting a local server

  • Any repository can be a remote for pulls
  • Can pull/push over shared folders or ssh
  • Pushing to someone's working copy is dangerous
  • Use git init --bare to make a copy for pushing
  • You don't need to create a "server" as such, any 'bare' git repo will do.
In [9]:
bare_dir = os.path.join(git_dir, 'bare_repo')
os.chdir(git_dir)
In [10]:
%%bash
mkdir -p bare_repo
cd bare_repo
git init --bare
Initialized empty Git repository in /home/runner/work/rsd-engineeringcourse/rsd-engineeringcourse/ch00git/learning_git/bare_repo/
In [11]:
os.chdir(working_dir)
In [12]:
%%bash
git remote add local_bare ../bare_repo
git push -u local_bare main
To ../bare_repo
 * [new branch]      main -> main
branch 'main' set up to track 'local_bare/main'.
In [13]:
%%bash
git remote -v
arc	git@github.com:UCL-ARC-RSEing-with-Python/github-example.git (fetch)
arc	git@github.com:UCL-ARC-RSEing-with-Python/github-example.git (push)
local_bare	../bare_repo (fetch)
local_bare	../bare_repo (push)
origin	git@github.com:UCL/github-example.git (fetch)
origin	git@github.com:UCL/github-example.git (push)

You can now work with this local repository, just as with any other git server. If you have a colleague on a shared file system, you can use this approach to collaborate through that file system.

Home-made SSH servers

Classroom exercise: Try creating a server for yourself using a machine you can SSH to:

ssh <mymachine>
mkdir mygitserver
cd mygitserver
git init --bare
exit
git remote add <somename> ssh://user@host/mygitserver
git push -u <somename> main

SSH keys and GitHub

Classroom exercise: If you haven't already, you should set things up so that you don't have to keep typing in your password whenever you interact with GitHub via the command line.

You can do this with an "ssh keypair". You may have created a keypair in the Software Carpentry shell training. Go to the ssh settings page on GitHub and upload your public key by copying the content from your computer. (Probably at .ssh/id_rsa.pub)

If you have difficulties, the instructions for this are on the GitHub website.