Showing results for: [ Information Retrieval and Web Search ]
The CSIRO Knowledge Network (KN) is an online platform that enables discovery and access to datasets across research, government and the innovation sector.
EcoScience Research Data Cloud - Knowledge Network development - Published 04 Oct 2018
The CSIRO 5-star Data Rating tool provides a self-assessment rating scheme against the social, technical and informational attributes of data. This tool provides implementations of the FORCE 11 FAIR d... moreata principles. The 5-star scheme aims to help users understand how mature some data or a service is. less
Operations DAMbusters - - Published 15 Jun 2018
The need to tag digital objects in various repositories such as datasets or scientific papers to allow for thematic classification is a well-known problem. In some cases a free tagging approach is use... mored where contributors can add any keywords they desire while in other cases, selection from a controlled vocabulary is mandated. To enable repository managers to allow both free tagging and the use of controlled vocabularies, we have created the Keyword Aggregator (KWA) system, key aspects of which are outlined next. A web service that: provides fast search access to vocabulary content; stores multiple, controlled, vocabularies of terms; permits the addition of new keywords. An example widget that makes use of the web service and can be embedded in a web page. Use of a relational database that captures keyword use statistics. A management methodology that: allows particular vocabularies to be selected for use on a per-widget instance basis; allows vocabulary updating through storage in versioned repositories; stores vocabularies with vocabulary-level metadata enabling vocabulary discovery; permits one vocabulary to link to a single term in another vocabulary (allows a 'vocabulary-of-vocabularies'). The focus of KWA is on the creation of a flexible keyword aggregation, management and search system for science keywords, however vocabularies of any kind could, in principle, be handled by this system. This release allows KWA to be run under Python 2 or 3 and more importantly, adds containerisation support via Docker to simplify deployment. In addition, graph loading scripts were added and various internal improvements and bug fixes were made. See the README.pdf supporting attachment for more detail.less
eReefs - Science of GBR - Semantic Tools - Published 19 Jan 2018
This software package provides a web-based search tool for users to search on content over SISSVoc endpoints.
OzNome for Water - - Published 10 Nov 2017
The following is an information retrieval test collection that contains:
* 204,855 publicly available clinical trails was crawled from ClinicalTrials.gov.
* 60 topics made up of three types: patient ... morecase descriptions, patient case summaries and assessor provided ad-hoc queries, totalling an average of 10.2 queries per topic.
* 4,000 assessor provided relevance assessment for topic, trial pairs.
Further details about the test collection can be found in the following publication:
B. Koopman and G. Zuccon. *A test collection for matching patient trials*. In Proceedings of the 39th annual international ACM SIGIR conference on research and development in information retrieval, Pisa, July 2016.less
JV Health Search - Information Retrieval Relevance Assessment - Published 04 Apr 2017
The file InformationNeeds-2014-Q02R03T04.tsv contains a set of 180 information needs or topic backstories that were authored in 2014 collectively by:
Peter Bailey, Alistair Moffat, Falk Scholer, Paul... more Thomas
The file contains:
1. a unique 3 digit, 0-padded id for each of the entries
2. a compound code that identifies the topic and source TREC collection from which the information need was derived
3. the task complexity type
4. the primary title field from the corresponding TREC source topic
5. the information need or topic backstory that we wrote for our experiments involving user variability. See our ADCS 2014, SIGIR 2015, and CIKM 2015 publications for more details.
Legacy data - Information retrieval - Published 03 Sep 2015
We are making publicly available a set of additional relevance assessments (qrels) for the TREC Medical Records Track. These are in additional to the officials qrels for this task provided by TREC org... moreanisers. The qrels are for the same set TREC query topics but are assessments for an additional 950 documents not previously judged by TREC assessors. As with TREC, the assessments were conducted by medical professionals.
Full details about how the assessments were obtained is available in Chapter 7 of:
B. Koopman. Semantic Search as Inference: Applications in Health Informatics. PhD thesis, Queensland University of Technology, Brisbane, Australia, 2014. http://koopman.id.au/papers/KoopmanPhDThesis-SemanticSearchAsInference.pdf
Or for more analysis of this assessment task see also:
B. Koopman and G. Zuccon. Why assessing relevance in medical IR is demanding. In Proceedings of the SIGIR Workshop on Medical Information Retrieval (MedIR), Gold Coast, Australia, July 2014. http://koopman.id.au/papers/medIR2014-relevance_assessment.pdfless
1057.1 HIE Research JV - n/a - Published 25 Jul 2014
A set of files containing RDF representations of the International [Chrono]stratigraphic Chart, including RDF/XML and Turtle serializations of data from the versions published in 2004, 2005, 2006, 200... more8, 2009, 2010, 2012, 2013, 2014
To accompany publication of paper
"A geologic timescale ontology and service" by SJD Cox and SM Richard
Submitted to Earth Science Informaticsless
Legacy data - AuScope - Published 15 May 2014