milanboers/clone.bash

ksaadDE · 2023-02-05T04:55:16Z

I encountered the same issue and made this helper has a bunch of other useful utilities too for managing your github projects

((page_count = public_repos / 100 + 1))

+10 for this smart move, you fetched the API repo amount and then use it for pagination to expand the crawl limits.

Will try your tool at some time, atm no ongoing tasks for that. But still good job! 👍

ricardo-reis-1970 · 2025-02-28T13:55:16Z

In my case, the pages have 30 entries, not 100. In any case, I think this solution is independent of that:

gubAll() {
  page=1
  repos=0
  entries=`curl -s https://api.github.com/users/$1/repos?page=$page | grep \"clone_url\"`
  while [[ -n $entries ]]
  do
    # echo "$entries"
    repos=$(( $repos + $(echo "$entries" | wc -l) ))
    echo "$entries" | awk '{print $2}' | sed -e 's/"//g' -e 's/,//g' | xargs -n1 git clone
    page=$(( $page + 1 ))
    entries=`curl -s https://api.github.com/users/$1/repos?page=$page | grep \"clone_url\"`
  done
  echo Pages: $(( $page -1 )), repos: $repos.
}

Then:

$ gubAll milanboers

will grab all 32 repos (at this point in time) from milanboers.

nealfennimore · 2025-03-01T21:41:30Z

This simultaneously downloads as many repos as possible from any organization using parallel

ORGANIZATION=myorg
NUM_REPOS=2000

gh repo list $ORGANIZATION -L $NUM_REPOS --json name |
    jq -r --arg org $ORGANIZATION '$org + "/" + .[].name' |
    parallel gh repo clone

milanboers/clone.bash

ksaadDE commented Feb 5, 2023

Uh oh!

ricardo-reis-1970 commented Feb 28, 2025

Uh oh!

nealfennimore commented Mar 1, 2025

Uh oh!