umakaparserが吐き出すJSONは以下のフィールドから成り立っています。
- meta_data
- prefixes
- classes
- properties
- inheritance_structure
それぞれどのように情報を取得しているのかについて記述します。
pip install torch | |
brew update | |
brew install rustup | |
rustup-init | |
source "$HOME/.cargo/env" | |
git clone https://github.com/huggingface/tokenizers | |
cd tokenizers/bindings/python | |
pip install setuptools_rust | |
python setup.py install | |
pip install transformers -U |
umakaparserが吐き出すJSONは以下のフィールドから成り立っています。
それぞれどのように情報を取得しているのかについて記述します。
{ | |
"@context": [ | |
"https://schema.org/docs/jsonldcontext.json", | |
{ | |
"@vocab": "http://json2ld.mapper.tokyo/ns/", | |
"json2ld": "http://json2ld.mapper.tokyo/ns/", | |
"name": { | |
"@id": "schema:name" | |
}, | |
"homepage": { |
<http://purl.org/allie/id/longform/1528191> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://purl.org/allie/ontology/201108#LongForm> . | |
<http://purl.org/allie/id/longform/1528191> <http://www.w3.org/2000/01/rdf-schema#label> "specific pathogen-free"@en . | |
<http://purl.org/allie/id/longform/1528191> <http://www.w3.org/2000/01/rdf-schema#label> "特定病原体除去"@ja . | |
<http://purl.org/allie/id/longform/1528191> <http://www.w3.org/2002/07/owl#sameAs> <http://dbpedia.org/resource/Specific_Pathogen_Free> . | |
<http://purl.org/allie/id/longform/1528191> <http://www.w3.org/2002/07/owl#sameAs> <http://dbpedia.org/resource/Specific_pathogen_free> . | |
<http://purl.org/allie/id/longform/1528191> <http://purl.org/allie/ontology/201108#frequency> "474" . | |
<http://purl.org/allie/id/pair/1547869> <http://purl.org/allie/ontology/201108#hasLongFormOf> <http://purl.org/allie/id/longform/1528191> . | |
<http://purl.org/allie/id/pair/1943614> <http://purl.org/allie/ontology/201108#hasLongFormOf> <http://purl.org/allie/id/longform/1528191> |
<http://identifiers.org/refseq/NC_011757.1#feature:567410-567778:-1:CDS.525> insdc:prote<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.1//EN" "ht2 - WWW Error 500 Diagnostic</title>s"> true . | |
$ java -jar ~/src/ConvRDF-master/ConvRDF.jar -i:turtle /tmp/yayamamo_fifo > refseq.nt | |
14:54:43 WARN riot :: [line: 2669123396, col: 90] Bad IRI: <http://identifiers.org/obo.go/0003676; GO> Code: 17/WHITESPACE in PATH: A single whitespace character. These match no grammar rules of URIs/IRIs. These characters are permitted in RDF URI References, XML system identifiers, and XML Schema anyURIs. | |
14:54:43 WARN riot :: [line: 2669123397, col: 1 ] Bad IRI: <http://identifiers.org/obo.go/0003676; GO> Code: 17/WHITESPACE in PATH: A single whitespace character. These match no grammar rules of URIs/IRIs. These characters are permitted in RDF URI References, XML system identifiers, and XML Schema anyURIs. | |
14:54:43 WARN riot :: [line: 2669123398, col: 1 ] Bad IR |
$ java -version | |
java version "1.7.0_75" | |
Java(TM) SE Runtime Environment (build 1.7.0_75-b13) | |
Java HotSpot(TM) 64-Bit Server VM (build 24.75-b04, mixed mode) |
$ pip install -U setuptools | |
Downloading/unpacking setuptools from https://pypi.python.org/packages/8d/ae/766f375fc05b3d345b7082333da9f8b49af02d9c5680ff4eb15655fc5ae1/setuptools-27.3.0-py2.py3-none-any.whl#md5=53f779309b8d04422b4f65cd09af4a28 | |
Downloading setuptools-27.3.0-py2.py3-none-any.whl (467kB): 467kB downloaded | |
Installing collected packages: setuptools | |
Found existing installation: setuptools 5.8 | |
Uninstalling setuptools: | |
Successfully uninstalled setuptools | |
Successfully installed setuptools | |
Cleaning up... |
<http://purl.jp/bio/10/mhlword20140908/102> <http://www.w3.org/2000/01/rdf-schema#seeAlso> <http://purl.jp/bio/10/lsd/term/J078068> . | |
<http://purl.jp/bio/10/mhlword20140908/38> <http://www.w3.org/2000/01/rdf-schema#seeAlso> <http://purl.jp/bio/10/lsd/term/J038273> . | |
<http://purl.jp/bio/10/mhlword20140908/105> <http://www.w3.org/2000/01/rdf-schema#seeAlso> <http://purl.jp/bio/10/lsd/term/J146645> . | |
<http://purl.jp/bio/10/mhlword20140908/31> <http://www.w3.org/2000/01/rdf-schema#seeAlso> <http://www.orpha.net/ORDO/Orphanet_610> . | |
<http://purl.jp/bio/10/mhlword20140908/62> <http://www.w3.org/2000/01/rdf-schema#seeAlso> <http://www.orpha.net/ORDO/Orphanet_90035> . | |
<http://purl.jp/bio/10/mhlword20140908/13> <http://www.w3.org/2000/01/rdf-schema#seeAlso> <http://www.orpha.net/ORDO/Orphanet_71211> . | |
<http://purl.jp/bio/10/mhlword20140908/11> <http://www.w3.org/2000/01/rdf-schema#seeAlso> <http://purl.jp/bio/10/lsd/term/J013955> . | |
<http://purl.jp/bio/10/mhlword20140908/47> <http://www.w3.org/2000/01/rdf-schema#label> |
# Obtain the label of a given class (:class1). | |
SELECT DISTINCT ?c (STR(?l) AS ?lb) | |
WHERE { | |
?c a :class1 ; | |
<http://www.w3.org/2000/01/rdf-schema#label> ?l . | |
} | |
# Obtain a list of classes. | |
SELECT DISTINCT ?c | |
WHERE { |