Is Semantic Web and Linked Data Good Enough? SPARQL & DBPedia vs. Python & IMDbPY

July 11, 2012 at 22:23

I think you used the wrong knowledge base. DBpedia is based on Wikipedia not in IMDB, so of course the information is going to be different. If you would have used LinkedMDB which amongst others also pulls its data from IMDB you might have gotten different results:
http://www.linkedmdb.org/snorql/ .

Reply

Emre Sevinc

July 11, 2012 at 23:11

Hello Alexandru,

I did not mention it in my article but I also tried LinkedMDB and its SPARQL endpoints. Unfortunately the simplest query fails to return a result and its web pages provide me with a content that is not very up to date, e.g. see the LinkedMDB page of Rachel Nichols who stars in Continuum: http://data.linkedmdb.org/page/actor/623 You will not see Continuum among them.

Reply

coskun gunduz

July 12, 2012 at 10:47

Big (or all!) companies should find ways to publish their up-to-date data without human effort (or much less effort, just for confirmation for example) i.e. automatically with some software.

Reply

Emre Sevinc

July 13, 2012 at 00:06

Yes, I agree but I still can’t see the short-term incentive for many of the companies. The lack of widespread expertise of Semantic Web and very easy to integrate tools only add to this. Nevertheless I still think there is hope, for example this video alone is a very good indication of semantic web activities going at full speed: http://videolectures.net/w3cworkshop2012_herman_w3c_semantic/

Reply

Çağatay Çallı

July 13, 2012 at 08:59

There is also the problem of correct query engine implementation according to SPARQL standards. Most projects lack this and work too defective for even simple facilities offered by SPARQL.

Aside from Apache Jena (great project) and professional products, effort wasted to find a correct tool (in your preferred programming language) to work with your own linked data is still a pain.

Reply

Emre Sevinc

July 13, 2012 at 10:30

Çağatay,

Would you care to give examples of SPARQL engines that return defective results for queries? Nowadays we are using OWLIM for the CUBIST project and it proves to be a very powerful solution. I also did a project in the past using AllegroGraph (and Common Lisp) and that platform was a convenient one, too.

On the other hand, yes, Jena is like bread and butter of semantic web programming and I’m glad that people are working for various bindings for it such as Scala. Nevertheless, there is still a long way to go in terms of our ability to express our designs and queries as compactly and intuitively as possible.

Reply

	PREFIX dbpedia2: <http://dbpedia.org/property/>

	SELECT ?artist
	FROM NAMED <http://live.dbpedia.org>
	WHERE {
	<http://dbpedia.org/resource/The_Shining_%28film%29> dbpedia2:starring ?artist .
	<http://dbpedia.org/resource/Hoffa> dbpedia-owl:starring ?artist .
	}
	LIMIT 10

	PREFIX dbpedia2: <http://dbpedia.org/property/>

	SELECT ?artist
	FROM NAMED <http://live.dbpedia.org>
	WHERE {
	<http://dbpedia.org/resource/The_Killing_%28U.S._TV_series%29> dbpedia2:starring ?artist .
	<http://dbpedia.org/resource/Continuum_%28TV_series%29> dbpedia-owl:starring ?artist .
	}
	LIMIT 10

	from imdb import IMDb

	imdb = IMDb()
	the_killing = imdb.get_movie('1637727')
	continuum = imdb.get_movie('1954347')

	imdb.update(the_killing, 'full credits')

	imdb.update(continuum, 'episodes')
	continuum_episode = continuum['episodes'][1][5]
	imdb.update(continuum_episode)

	cast_of_the_killing = the_killing['cast']
	cast_of_continuum_episode = continuum_episode['cast']

	for actor in set(cast_of_the_killing).intersection(cast_of_continuum_episode):
	print actor

	Doktorunuz, kanser t… on Sizi muayene eden doktorunuz i…
	En önemli ikinci pro… on 'Nerdy' bir bilimcin…
	Kids have to know th… on Müzik Enstrumanları Müzesi…
	Catch-22, Hindistan… on Emacs ile caz çalmak mümkün…
	Catch-22, Hindistan… on İşitsel Programlama, Common Mu…

FZ Blogs