This project has retired. For details please refer to its Attic page.
Apache Stanbol - The Named Entity Tagging Engine: linking text annotations to (external) datasets of entities

The Named Entity Tagging Engine: linking text annotations to (external) datasets of entities

The Entity Linking Engine uses Referenced Sites to search for Entities based on given Text Annotations.

Configuration

The configuration decides, which dataset you want to use as linking target. The default value is "local" referencing to the default DBpedia index. You may also decide on whether given types should restrict the set of possible links. E.g. for DBpedia, some organisations are not tagged as such, therefore, you want get them with this engine although, you expect them from your dataset.

Example Result

For the sentence "The Stanbol enhancer can detect famous cities such as Paris and people such as Bob Marley.", you will get several EntityAnnotations for the terms "Paris" and "Bob Marley" from your linking target resource (in this case DBpedia) together with a confidence value, which can be used to sort the suggestions, e.g.:

{
  "@subject": "urn:enhancement-b98283ae-845d-6666-d68b-f649852a7959",
  "@type": ["enhancer:Enhancement","enhancer:EntityAnnotation"],
  "dc:created": "2012-02-29T11:34:56.383Z",
  "dc:creator": "org.apache.stanbol.enhancer.engines.entitytagging.impl.NamedEntityTaggingEngine",
  "dc:relation": "urn:enhancement-b3d4617d-1760-0374-f471-e0e746003f4e",
  "enhancer:confidence": 16641.191,
  "enhancer:entity-label": 
     {
       "@literal": "Bob Marley",
       "@language": "en"
     },
   "enhancer:entity-reference": "http://dbpedia.org/resource/Bob_Marley",
   "enhancer:entity-type": 
      ["dbp-ont:MusicalArtist", "foaf:Person", "dbp-ont:Artist",
        "dbp-ont:Person", "owl:Thing"],
   "enhancer:extracted-from": "urn:content-item-sha1-37c8a8244041cf6113d4ee04b3a04d0a014f6e10"
  },

or

{
  "@subject": "urn:enhancement-785a4c4f-dc7d-aa46-91a2-aef840542ae2",
  "@type": ["enhancer:Enhancement","enhancer:EntityAnnotation"],
  "dc:created": "2012-02-29T11:34:56.383Z",
  "dc:creator": "org.apache.stanbol.enhancer.engines.entitytagging.impl.NamedEntityTaggingEngine",
  "dc:relation": "urn:enhancement-c176f1bf-a1dd-830e-df7d-deecdfdc8375",
  "enhancer:confidence": 1323049.5,
  "enhancer:entity-label": 
     {
       "@literal": "Paris",
       "@language": "en"
     },
   "enhancer:entity-reference": "http://dbpedia.org/resource/Paris",
   "enhancer:entity-type":
      ["dbp-ont:PopulatedPlace","dbp-ont:Settlement",
      "http://www.opengis.net/gml/_Feature",
      "dbp-ont:Place","owl:Thing"],
   "enhancer:extracted-from": "urn:content-item-sha1-37c8a8244041cf6113d4ee04b3a04d0a014f6e10"
 }