[DL] [ANNOUNCE] Fact Extraction from Wikipedia Text datasets released
Marco Fossati
fossati at fbk.eu
Wed Sep 2 20:29:41 CEST 2015
[Begging pardon if you read this multiple times]
The Italian DBpedia chapter, on behalf of the whole DBpedia Association,
is thrilled to announce the release of new datasets extracted from
Wikipedia text.
This is the outcome of an outstanding Google Summer of Code 2015
project, which implements NLP techniques to acquire structured facts
from a textual corpus.
The approach has been tested on the soccer use case, with the Italian
Wikipedia as input.
The datasets are publicly available at:
http://it.dbpedia.org/downloads/fact-extraction/
and loaded into the SPARQL endpoint at:
http://it.dbpedia.org/sparql
You can check out this article for more details:
http://it.dbpedia.org/2015/09/meno-chiacchiere-piu-fatti-una-marea-di-nuovi-dati-estratti-dal-testo-di-wikipedia/?lang=en
If you feel adventurous, you can fork the codebase at:
https://github.com/dbpedia/fact-extractor
Get in touch with Marco at fossati at fbk.eu for everything else.
Best regards,
Marco Fossati
More information about the dl
mailing list