Last active
April 27, 2023 15:00
-
-
Save jbarratt/fa1d3473048e5f856aeb to your computer and use it in GitHub Desktop.
'nbgrep', search the code of all your ipython notebooks
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
#!/bin/bash | |
# usage: nbgrep 'pattern' | |
SEARCHPATH=~/work/ | |
# 'jq' technique lifted with gratitude | |
# from https://gist.github.com/mlgill/5c55253a3bc84a96addf | |
# Break on newlines instead of any whitespace | |
# IPython Notebook files often have spaces in it | |
SAVEIFS=$IFS | |
IFS=$(echo -en "\n\b") | |
if ! type mdfind > /dev/null 2>&1; then | |
# Use find from findutils | |
FILES=$(find $SEARCHPATH -name '*.ipynb') | |
else | |
# mdfind uses OSX's spotlight search, so it's almost instant | |
# generate a list of all the ipynb files in any of the directories | |
FILES=$(mdfind -onlyin $SEARCHPATH -name '.ipynb') | |
fi | |
# On the command line we get the argument to search for | |
PATTERN=$1 | |
for f in $FILES | |
do | |
# Use 'jq' to filter out only the code in input cells | |
# Then remove quoting | |
# Colorize it with pygments (give it the most context possible to get color right) | |
# And finally, search the remainder for a given pattern | |
OUTPUT=$(jq '.worksheets[]?.cells[]? | select(.cell_type=="code") | .input[]?//.input' $f \ | |
| sed 's/^"//g;s/"$//g;s/\\n$//g;s/\\"/"/g;s/\\\\/\\/g;s/\\n/\n/g' \ | |
| pygmentize -l python 2>/dev/null \ | |
| grep $PATTERN) | |
# If the grep matched anything, print it | |
if [ $? -eq 0 ]; then | |
echo -e "$f:\n\n$OUTPUT\n\n" | |
fi | |
done | |
IFS=$SAVEIFS |
I find that when I try to use this function (in Ubuntu Linux), I tend to get lot of repeats of the following. Any idea what gives?
I run ~/Programs/nbgrep.sh "phylo_join"
And get lots of repeats of this:
3 compile errors
error: Invalid character
.worksheets[]?.cells[]? | select(.cell_type=="code") | .input[]?//.input
^
error: Invalid character
.worksheets[]?.cells[]? | select(.cell_type=="code") | .input[]?//.input
^
error: Invalid character
.worksheets[]?.cells[]? | select(.cell_type=="code") | .input[]?//.input
^
Hey everyone, please try out nbcommands, it has a nbgrep
command too! And it can simply be installed using pip.
Would love to add any jq feature that it might be missing, please open an issue on the repo for that :)
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Thanks @jbarratt and @gmorain, it just works in ubuntu, a take