-
jq — https://jqlang.github.io/jq/ — "like sed for JSON data"
There are several options available for installing jq. I prefer to use Homebrew:
brew install jq
-
jq -s '.' input.jsonl > output.json
-
jq -c '.[]' input.json > output.jsonl
jq — https://jqlang.github.io/jq/ — "like sed for JSON data"
There are several options available for installing jq.
I prefer to use Homebrew: brew install jq
jq -s '.' input.jsonl > output.json
jq -c '.[]' input.json > output.jsonl
@spidyhackx See https://stedolan.github.io/jq/download/
what's jq and can i install it with
pip install jq
?
install using:sudo apt install jq
🫶
Legend
It's unbelievable, jq can solve this in one line. I was about to embark on writing yet-another-python-script.py to convert a gzipped nested JSON file to JSONL, but thankfully I came across this post.
Suppose you have a JSON like this:
{
"meta" : { }
, "data" : [
{ "idx" : 1
, "input" : "ABC"
, target : "123"
, some_other_field : "zzz"
},
{ "idx" : 2
, "input" : "DEF"
, target : "456"
, some_other_field : "zzz"
},
...
]
}
You can use the following one-line command to extract the 'data' array, keep the 'input' and 'target' fields only, and generate JSONL:
gunzip -c somefile.json.gz | jq .data | jq -c '.[] | {input, target}'
Output
{"input" : "ABC, "target" : "123"}
{"input" : "DEF, "target" : "456"}
...
what's jq and can i install it with
pip install jq
?