I would like to use git clone --depth [N]
, but to use such N
that guarantees that a particular sha is obtained.
How I can determine the depth of a sha of in a remote repo. Note cloning it locally to do that is catch 22. I would like to do that to avoid cloning it all.
Option 1:
If you have the ability to get to a full clone of the repository, you can find the depth using git rev-list HEAD ^42c6ee9 --count
.
This will find the depth of any particular commit. There is no remote version so this only works if you can maintain a full copy and then ssh
into it to figure out the depth.
This allows you to only have to clone once but then you'll be able to answer the question for all the following times you'd like to do a shallow copy.
Option 2:
Using git clone --depth 1
then iterating on git fetch --depth=i+1
is actually a good idea worth testing. (Also proposed by @leon above).
Depending on the characteristics of your repository, this will make sense.
E.g. Django repository has 23330 commits (at the time of testing)
./full.sh - pulling the full local repository
git clone https://github.com/django/django
./oracle.sh - if you magically knew the right answer. lower bound on time.
git clone --depth 10 https://github.com/django/django.git
./search.sh - iterating
git clone --depth 1 https://github.com/django/django.git
cd django
i=1
until git show 5d35181 > /dev/null
do
i=$((i+1))
git fetch --depth=$i
done
The search, while there is overhead, may still come out faster then a full clone.
./full.sh 21.34s
./oracle.sh 1.12s
./search.sh 3.05s