Deleting batches of records with SPARQL

I learned this when trying to clear our records in AWS Neptune. I was hitting the query timeout when trying to drop an entire graph. If you don't want to/can't raise the timeout, you can drop smaller parts of the graph in each transaction.

curl -sX POST http://<cluster-prefix>.rds.amazonaws.com:8182/sparql --data-urlencode 'update=
DELETE {
  GRAPH <http://aws.amazon.com/neptune/vocab/v01/DefaultNamedGraph> { ?s ?p ?o }
}
WHERE {
  GRAPH <http://aws.amazon.com/neptune/vocab/v01/DefaultNamedGraph> {
    {
      SELECT ?s ?p ?o
      WHERE {
        ?s ?p ?o .
      }
      LIMIT 10
    }
  }
}
'

This will delete 10 records, specifically the first 10 that are returned for a SELECT * WHERE { ?s ?p ?o } query. You can adjust the limit value to find a batch size that keeps you under the timeout.

Yeah, this is a dirty hack but there was a bit of pain to learn this so I want to store the knowledge.

Also, be sure to use --data-urlencode not --data-binary otherwise you might find the server ignores your input but doesn't give any indication of error.

tomsaleeba/README.md

Select an option

No results found

Select an option

No results found

agcunha commented Oct 7, 2019

Uh oh!

GeniJaho commented Oct 17, 2020

Uh oh!

nzewail commented Jan 5, 2022

Uh oh!

jimsmart commented Mar 12, 2023 •

edited

Loading

Uh oh!

tomsaleeba/README.md

agcunha commented Oct 7, 2019

Uh oh!

GeniJaho commented Oct 17, 2020

Uh oh!

nzewail commented Jan 5, 2022

Uh oh!

jimsmart commented Mar 12, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jimsmart commented Mar 12, 2023 •

edited

Loading