Search
Now showing items 1-10 of 85
Validation of expressive XML keys with XML schema and XQuery
(ACS, 2015)
The eXtensible Markup Language (XML) is the defacto
industry standard for exchanging data on the
Web and elsewhere. While the relational model of
data enjoys a well-accepted definition of a key, several
competing notions ...
Random Indexing Explained with High Probability
(2015)
Random indexing (RI) is an incremental method for constructing a vector space model (VSM) with a reduced dimensionality. Previously, the method has been justified using the mathematical framework of Kanerva's sparse ...
On a Linked Data platform for Irish historical vital records
(Springer, 2015)
The Irish Record Linkage 1864-1913 is a multi-disciplinary project aiming to create a platform for analyzing events captured in historical birth, marriage and death records by applying semantic technologies for annotating, ...
SemEval-2016 Task 13: Taxonomy Extraction Evaluation (TExEval-2)
(Insight Centre for Data Analytics, 2016-06-16)
This paper describes the second edition of the shared task on Taxonomy Extraction Evaluation organised as part of SemEval 2016. This task aims to extract hypernym-hyponym relations between a given list of domain-specific ...
Community topic usage in social networks
(ACM, 2015-10)
When studying large social media data sets, it is useful to reduce the dimensionality of both the network (e.g. by finding communities) and user-generated data such as text (e.g. using topic models). Algorithms exist for ...
The ACL RD-TEC: A Dataset for Benchmarking Terminology Extraction and Classification in Computational Linguistics
(2014)
This paper introduces ACL RD-TEC: a dataset for evaluating the extraction and classification of terms from literature in the domain of computational linguistics. The dataset is derived from the Association for Computational ...
Semantic Tagging of Places Based on User Interest Profiles from Online Social Networks
(2013)
In the recent years, location based services (LBS) on mobile devices
have become very popular. With the growing number of smartphone users, the
demand for services that can provide recommendation of places based on ...
GenomeSnip: Fragmenting the Genomic Wheel to augment discovery in cancer research
(2014)
Cancer genomics researchers have greatly benefited from high-throughput technologies for the characterization of genomic alterations in patients. These voluminous genomics datasets when supplemented with the appropriate ...
On learnability of constraints from RDF data
(Springer International Publishing, 2016-05-14)
RDF is structured, dynamic, and schemaless data, which enables a big deal of flexibility for Linked Data to be available in an open environment such as the Web. However, for RDF data, flexibility turns out to be the source ...
Discovering Domain-Specific Public SPARQL Endpoints: A Life-Sciences Use-Case
(2014)
A significant portion of the LOD cloud consists of Life Sciences data sets. The LOD cloud contains billions of clinical
facts linked together forming an interlinked Web of Clinical Data . However, tools for new publishers ...










