Question of the Week: Kadaster!

From The QA Company
Revision as of 11:04, 6 March 2023 by 172.24.0.1 (talk) (Blog post for The QA Company's website)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

Showimage.5a31eb26.pngToday, for the “question of the week”, I'm going to describe a use case of Question Answering technologies. Querying the DutchKadaster, i.e. the land recording of the real estate or real property's of the Netherlands. We were asked to construct on this data a question answering system and I'm going to present the results.

Some context .... Kadaster is publishing several of their datasets (e.g. topography, addresses & buildings) as a Knowledge Graph, using Triply. It is available at https://data.labs.kadaster.nl/kadaster/kg/. The Graph is pretty large, containing a bit more than 800 million triples. Most of the information is encoded in dutch using different vocabularies like schema.org properties. It contains mainly information about cities, their boundaries, their population, the real estates they contain, addresses and points of interests (which are relevant for the Kadaster). This data is not indexed by search engines like Google or digital assistants like Siri.

We indexed the dataset as is and trained [ QAnswer] on top of it. We were able to do this even if we do not speak dutch ; ). These are some of the questions we could answer on top of the dataset:


Our conclusion was that this looks a bit like Google Maps just on open government data. We liked this contract and to work on this geographical dataset, also our dutch got slightly better ; )

That's it for today!


See you next week!The QA Company