Is it possible to get a dataset from three sources in DBPedia?
Suppose I have requested DBPedia like this :
select * where
{
?x ?y ?z .
filter (?x = <http://dbpedia.org/resource/Abracadabra>)
}
and the result is a lot of triplets:
x y z
http://dbpedia.org/resource/Abracadabra http://www.w3.org/2002/07/owl#sameAs http://de.dbpedia.org/resource/Abrakadabra
http://dbpedia.org/resource/Abracadabra http://www.w3.org/2002/07/owl#sameAs http://fr.dbpedia.org/resource/Abracadabra
http://dbpedia.org/resource/Abracadabra http://www.w3.org/2002/07/owl#sameAs http://ko.dbpedia.org/resource/아브라카다브라
...
Is it possible to determine which of these datasets is from each triplet? I want to download and use locally some of these datasets, but first I have to figure out which ones are useful to me based on their contained triplets.
In the worst case, I would like to know which dataset (s) contain rdfs: labels.
PS Doesn't this approach work ,? g is always http://dbpedia.org
source to share
From the directories in the listing you linked to, I think you would need to disable data from the English Wikipedia . However, there are still many files in there. The DBpedia dataset (3.9) contains more information about the various files that you can download. Perhaps most importantly, he says:
Find properties used across different DBpedia datasets here .
This link takes you to DBpedia 3.9 Dataset Properties which will answer you, I guess what properties are in datasets. To answer your "worst case", it specifically says that rdfs: label values are stored in the Titles dataset . Even though they call it credits, I think you will find how
labels_en.nq.bz2
labels_en.nt.bz2
labels_en.tql.bz2
labels_en.ttl.bz2
in the list you linked to. I don't know if there is a way to automate the search for datasets. It would be nice if the table in DBpedia 3.9 Data Set Properties was coded somewhere and could be queried because then it would be easy.
source to share