[DL] [ANNOUNCE] Fact Extraction from Wikipedia Text datasets released

Marco Fossati fossati at fbk.eu
Wed Sep 2 20:29:41 CEST 2015


[Begging pardon if you read this multiple times]

The Italian DBpedia chapter, on behalf of the whole DBpedia Association, 
is thrilled to announce the release of new datasets extracted from 
Wikipedia text.

This is the outcome of an outstanding Google Summer of Code 2015 
project, which implements NLP techniques to acquire structured facts 
from a textual corpus.

The approach has been tested on the soccer use case, with the Italian 
Wikipedia as input.

The datasets are publicly available at:
http://it.dbpedia.org/downloads/fact-extraction/

and loaded into the SPARQL endpoint at:
http://it.dbpedia.org/sparql

You can check out this article for more details:
http://it.dbpedia.org/2015/09/meno-chiacchiere-piu-fatti-una-marea-di-nuovi-dati-estratti-dal-testo-di-wikipedia/?lang=en

If you feel adventurous, you can fork the codebase at:
https://github.com/dbpedia/fact-extractor

Get in touch with Marco at fossati at fbk.eu for everything else.

Best regards,
Marco Fossati



More information about the dl mailing list