leoloobeek/get_gists.py

rene-d · 2018-07-16T06:28:06Z

simple and efficient. I have added a dictionary file to translate id to readable names and use this script to backup my gists (fork).

epogrebnyak · 2018-08-24T11:14:41Z

Thanks for sharing this code. I added directory name slugs from django which condense i['description'] to a valid path, so that one can omit the description file. See at https://gist.github.com/epogrebnyak/c14d6d2ca2740d1e1018e701ea00472a

pfuntner · 2019-01-18T15:12:02Z

Really useful! Thanks! I had no idea that each gist was stored its own repository but it makes sense.

brossetti1 · 2019-01-23T01:27:33Z

if anyones interested in using the description as the folder name instead of the id:

# first: mkdir user && cd user && cp /path/to/get_gists.py .
# python3 get_gists.py user
import requests
import sys
from subprocess import call

user = sys.argv[1]

r = requests.get('https://api.github.com/users/{0}/gists'.format(user))

for i in r.json():
  folder = i['description'] if i['description'] else i['id']
  call(['git', 'clone', i['git_pull_url'], folder])

  description_file = './{0}/description.txt'.format(folder)
  with open(description_file, 'w') as f:
    f.write('{0}\n'.format(i['description']))

aabarbosa · 2019-03-30T08:28:54Z

The Linux File System has a limit of 255 characters for naming folders.

fatal: could not create leading directories of 'long description.': File name too long

It also works other than naming all files with those descriptions. I was guessing how to print it besides the folder name (any help would be interesting).
cat **/*.txt

You would prefer this solution:
folder = i['description'][0:255] if i['description'] else i['id']

This is the final code (for those against the clock)

# first: mkdir user && cd user && cp /path/to/get_gists.py .
# python3 get_gists.py user
import requests
import sys
from subprocess import call

user = sys.argv[1]

r = requests.get('https://api.github.com/users/{0}/gists'.format(user))

for i in r.json():
        folder = i['description'][0:255] if i['description'] else i['id']
        call(['git', 'clone', i['git_pull_url'], folder])
        description_file = './{0}/description.txt'.format(folder)
        with open(description_file, 'w') as f:
                f.write('{0}\n'.format(i['description']))

jamesbrink · 2019-06-07T02:31:15Z

thank you all for saving me time always appreciated

pixelstorm · 2019-08-12T05:56:20Z

for some reason it does not download all the gists. i only get about a quarter of all my gists

renat-abbyazov · 2019-09-06T10:33:51Z

@pixelstorm it seems there is a restriction for only 30 gists for github api call response, one could add a page number in order to get all gists
https://stackoverflow.com/a/16233710

zubair1024 · 2020-02-01T15:45:50Z

Try this if you're a NodeJS person:
https://github.com/zubair1024/gist-puller

selimslab · 2020-09-25T12:00:43Z

Only downloads the first page of gists, forked to download all -> https://gist.github.com/selimslab/958e2255a105f9a3f4b421896bce715d

antonydevanchi · 2021-06-14T22:58:30Z

Just added some codestyle, shebang, one useless comment and parallelism.

🙃

#!/usr/bin/env python3

import os
import sys
import json
import hashlib
import requests

from subprocess import call
from concurrent.futures import ThreadPoolExecutor as PoolExecutor

def download_all_from_user(user: str):
    
    next_page = True
    page = 1
    
    while next_page:
        
        url = f"https://api.github.com/users/{user}/gists?page={page}"
        
        response = requests.get(url)

        if not len(response.json()):
            next_page = False
        else:
            page += 1

        download_all(response.json())

def download_all(gists: list):
    with PoolExecutor(max_workers=10) as executor:
        for _ in executor.map(download, gists):
            pass

def download(gist):
    
    target = gist["id"] + hashlib.md5(gist["updated_at"].encode('utf-8')).hexdigest()
    
    call(["git", "clone", gist["git_pull_url"], target])

    description_file = os.path.join(target, "description.txt")
    
    with open(description_file, "w") as f:
        f.write(f"{gist['description']}\n")

# Run

user = sys.argv[1]

download_all_from_user(user)

Kamalabot · 2022-07-21T02:59:31Z

There is more direct way to get at these files, as it is shared by Observable's Ian in the following notebook.
https://observablehq.com/@enjalot/blockbuilder-search-data
The data is already available as JSON format, which provides more insights into the gists as per the notebook. I have made a crude way of getting at gist ids and the thumbnails, here
https://kamalabot.github.io/M3nD3/blocksD3.html

AysadKozanoglu · 2022-09-05T21:02:56Z

@antonydevanchi @leoloobeek thanks, both code works fine

santry · 2022-12-13T05:14:27Z

One-liner to get the first 100 of your own private gists:

curl -H "Authorization: Bearer <your_token>" 'https://api.github.com/gists?per_page=100' | jq '.[] | .git_pull_url' | xargs -n 1 git clone

Relies on jq.

graylan0 · 2023-12-25T19:49:12Z

import os
import sys
import json
import hashlib
import requests
import logging

from subprocess import call, CalledProcessError
from concurrent.futures import ThreadPoolExecutor as PoolExecutor

# Set up basic logging
logging.basicConfig(level=logging.INFO, format='%(asctime)s - %(levelname)s - %(message)s')

def download_all_from_user(user: str):
    next_page = True
    page = 1
    
    while next_page:
        url = f"https://api.github.com/users/{user}/gists?page={page}"
        response = requests.get(url)

        try:
            gists = response.json()
            if not gists:
                next_page = False
                continue
        except json.JSONDecodeError:
            logging.error("Invalid JSON response")
            break

        page += 1
        download_all(gists)

def download_all(gists: list):
    with PoolExecutor(max_workers=10) as executor:
        for _ in executor.map(download, gists):
            pass

def download(gist):
    if "id" not in gist or "updated_at" not in gist or "git_pull_url" not in gist:
        logging.error("Missing required gist information")
        return

    target = gist["id"] + hashlib.md5(gist["updated_at"].encode('utf-8')).hexdigest()

    try:
        call(["git", "clone", gist["git_pull_url"], target])
    except CalledProcessError as e:
        logging.error(f"Failed to clone gist: {e}")
        return

    description_file = os.path.join(target, "description.txt")
    
    try:
        with open(description_file, "w") as f:
            f.write(f"{gist.get('description', 'No description')}\n")
    except IOError as e:
        logging.error(f"Failed to write description file: {e}")

# Main execution
if __name__ == "__main__":
    if len(sys.argv) > 1:
        user = sys.argv[1]
        download_all_from_user(user)
    else:
        logging.error("No user specified")

leoloobeek/get_gists.py

rene-d commented Jul 16, 2018 •

edited

Loading

epogrebnyak commented Aug 24, 2018

pfuntner commented Jan 18, 2019

brossetti1 commented Jan 23, 2019

aabarbosa commented Mar 30, 2019

jamesbrink commented Jun 7, 2019

pixelstorm commented Aug 12, 2019

renat-abbyazov commented Sep 6, 2019

zubair1024 commented Feb 1, 2020

selimslab commented Sep 25, 2020

antonydevanchi commented Jun 14, 2021

Kamalabot commented Jul 21, 2022

AysadKozanoglu commented Sep 5, 2022

santry commented Dec 13, 2022 •

edited

Loading

graylan0 commented Dec 25, 2023

	# first: mkdir user && cd user && cp /path/to/get_gists.py .
	# python3 get_gists.py user
	import requests
	import sys
	from subprocess import call

	user = sys.argv[1]

	r = requests.get('https://api.github.com/users/{0}/gists'.format(user))

	for i in r.json():
	call(['git', 'clone', i['git_pull_url']])

	description_file = './{0}/description.txt'.format(i['id'])
	with open(description_file, 'w') as f:
	f.write('{0}\n'.format(i['description']))

leoloobeek/get_gists.py

rene-d commented Jul 16, 2018 • edited Loading

epogrebnyak commented Aug 24, 2018

pfuntner commented Jan 18, 2019

brossetti1 commented Jan 23, 2019

aabarbosa commented Mar 30, 2019

jamesbrink commented Jun 7, 2019

pixelstorm commented Aug 12, 2019

renat-abbyazov commented Sep 6, 2019

zubair1024 commented Feb 1, 2020

selimslab commented Sep 25, 2020

antonydevanchi commented Jun 14, 2021

Kamalabot commented Jul 21, 2022

AysadKozanoglu commented Sep 5, 2022

santry commented Dec 13, 2022 • edited Loading

graylan0 commented Dec 25, 2023

rene-d commented Jul 16, 2018 •

edited

Loading

santry commented Dec 13, 2022 •

edited

Loading