Browsing Insight Centre for Data Analytics (Conference Papers) by Title
Now showing items 1-20 of 79
-
The ACL RD-TEC: A Dataset for Benchmarking Terminology Extraction and Classification in Computational Linguistics
(2014)This paper introduces ACL RD-TEC: a dataset for evaluating the extraction and classification of terms from literature in the domain of computational linguistics. The dataset is derived from the Association for Computational ... -
Analysing and improving embedded markup of learning resources on the web
(ACM, 2017-04-03)Web-scale reuse and interoperability of learning resources have been major concerns for the technology-enhanced learning community. While work in this area traditionally focused on learning resource metadata, provided ... -
Analyzing Social Behavior of Software Developers Across Different Communication Channels
(2013)Software developers use different project repositories (i.e., mailing list, bug tracking repositories, discussion forums etc.) to interact with each other or to solve software related problems. The growing interest in the ... -
Benchmarking Domain-Specific Expert Search Using Workshop Program Committees
(ACM, 2013)Traditionally, relevance assessments for expert search have been gathered through self-assessment or based on the opinions of co-workers. We introduce three benchmark datasets1 for expert search that use conference workshops ... -
Challenges with image event processing: Poster
(ACM, 2017-06-19)There has been substantial research in the area of event processing where systems are focused on event processing of structured data. However, in the context of smart cities, signi cant number of realtime applications ... -
Classifying sentential modality in legal language: A use case in financial regulations, acts and directives
(ACM, 2017-06-12)Texts expressed in legal language are often di cult and time consuming for lawyers to read through, particularly for the purpose of identifying relevant deontic modalities (obligations, prohibitions and permissions). ... -
Community topic usage in social networks
(ACM, 2015-10)When studying large social media data sets, it is useful to reduce the dimensionality of both the network (e.g. by finding communities) and user-generated data such as text (e.g. using topic models). Algorithms exist for ... -
DAW: Duplicate-AWare Federated Query Processing over the Web of Data
(Springer, 2013)Over the last years the Web of Data has developed into a large compendium of interlinked data sets from multiple domains. Due to the decentralised architecture of this compendium, several of these datasets contain duplicated ... -
Discovering Domain-Specific Public SPARQL Endpoints: A Life-Sciences Use-Case
(2014)A significant portion of the LOD cloud consists of Life Sciences data sets. The LOD cloud contains billions of clinical facts linked together forming an interlinked Web of Clinical Data . However, tools for new publishers ... -
Domain-independent term extraction through domain modelling
(10th International Conference on Terminology and Artificial Intelligence, 2013-09-11)Extracting general or intermediate level terms is a relevant problem that has not received much attention in literature. Current approaches for term extraction rely on contrastive corpora to identify domain-specific terms, ... -
Entity Linking with Multiple Knowledge Bases: An Ontology Modularization Approach
(Springer, 2014-10-19)The recognition of entities in text is the basis for a series of applications. Synonymy and Ambiguity are among the biggest challenges in identifying such entities. Both challenges are addressed by Entity Linking, the task ... -
Event Analysis in Social Media Using Clustering of Heterogeneous Information Networks
(The 28th International FLAIRS Conference (AAAI Publications) (AAAI), 2015)In this paper, we propose a novel approach for social media event finding in order to support fast access to information that users find relevant. While there are many approaches related to this problem, they mainly focus ... -
Financial Industry Ontologies for Risk and Regulation Data (FIORD): a position paper
(2013)This paper presents a proposed approach to address risk andregulation management within the highly active and volatile financial domainby employing semantic based technologies within a collaborative networksenvironment. ... -
A Formal Investigation of Semantic Interoperability of HCLS Systems
(IGI Global, 2013)Semantic interoperability facilitates Health Care and Life Sciences (HCLS) systems in connecting stakeholders (e.g., patient, physician, pharmacy) at various levels as well as ensure seamless use of healthcare resources ... -
Fostering Serendipity through Big Linked Data
(2013)The amount of bio-medical data available over the Web grows exponentially with time. The large volume of the currently available data makes it difficult to explore, while the velocity at which this data changesand the ... -
GenomeSnip: Fragmenting the Genomic Wheel to augment discovery in cancer research
(2014)Cancer genomics researchers have greatly benefited from high-throughput technologies for the characterization of genomic alterations in patients. These voluminous genomics datasets when supplemented with the appropriate ... -
Grand challenge: Automatic anomaly detection over sliding windows
(Association for Computing Machinery ACM, 2017-06-19)With the advances in the Internet of Things and rapid generation of vast amounts of data, there is an ever growing need for leveraging and evaluating event-based systems as a basis for building realtime data analytics ... -
Hot Topics and Schisms in NLP: Community and Trend Analysis with Saffron on ACL and LREC Proceedings
(2014)In this paper we present a comparative analysis of two series of conferences in the field of Computational Linguistics, the LREC conference and the ACL conference. Conference proceedings were ...
