Survey the Use of Datatypes in RDF on the Web
RDF is a framework for the distributed representation of knowledge on the web. The Web Data Commons is a RDF representation of structured data extracted from the Common Crawl, a snapshot of the web. In RDF, values (e.g. names, prices, coordinates, …) are represented with literals, that have specific data types. We would like to survey the usage of different datatypes on the web, based on Web Data Commons.
Your task
- develop a tool to stream the Web Data Commons dataset and collect statistics on the usage of datatypes in RDF on the web
Your qualifications
- good programming skills (e.g. Java, Python, NodeJS, …)
- basic SQL skills
- preferable basic knowledge about RDF
Your benefits
- contribute to current research
- work on exciting technologies
- work with the structured data of the whole web
Your application
Please contact jan-martin.keil@uni-jena.de for further information and send your informal application via e-mail to jan-martin.keil@uni-jena.de.