Skip to content

Instantly share code, notes, and snippets.

@sloanlance
Last active April 3, 2025 22:17
Show Gist options
  • Save sloanlance/c3bf746b6396f60d321f5535e1ced892 to your computer and use it in GitHub Desktop.
Save sloanlance/c3bf746b6396f60d321f5535e1ced892 to your computer and use it in GitHub Desktop.
jq: JSONL ↔︎ JSON conversion

jq: JSONL ↔︎ JSON conversion

Prerequisites

  • jqhttps://jqlang.github.io/jq/ — "like sed for JSON data"

    There are several options available for installing jq. I prefer to use Homebrew: brew install jq

  1. JSONL → JSON

    jq -s '.' input.jsonl > output.json
  2. JSON → JSONL

    jq -c '.[]' input.json > output.jsonl
@spidy0x0
Copy link

spidy0x0 commented Feb 7, 2022

what's jq and can i install it with pip install jq?

@klein0r
Copy link

klein0r commented Mar 17, 2022

@ten0hira
Copy link

what's jq and can i install it with pip install jq?
install using: sudo apt install jq

@frfernandezdev
Copy link

🫶

@delano
Copy link

delano commented Mar 22, 2024

Legend

@sloanlance
Copy link
Author

sloanlance commented Mar 22, 2024 via email

@sloanlance
Copy link
Author

sloanlance commented Mar 22, 2024 via email

@EdGaere
Copy link

EdGaere commented Apr 1, 2025

It's unbelievable, jq can solve this in one line. I was about to embark on writing yet-another-python-script.py to convert a gzipped nested JSON file to JSONL, but thankfully I came across this post.

Suppose you have a JSON like this:

{
   "meta" : { }

 , "data" : [
    { "idx" : 1
      , "input" : "ABC"
     , target : "123"
     , some_other_field : "zzz" 
    },

   { "idx" : 2
      , "input" : "DEF"
     , target : "456"
     , some_other_field : "zzz" 
    }, 
   ...

 ]
}

You can use the following one-line command to extract the 'data' array, keep the 'input' and 'target' fields only, and generate JSONL:
gunzip -c somefile.json.gz | jq .data | jq -c '.[] | {input, target}'

Output

{"input" : "ABC, "target" : "123"}
{"input" : "DEF, "target" : "456"}
...

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment