Show ONLY these Collection Types:
Show ONLY these Categories:
Show ONLY these Projects:
Show ONLY these People:
Show ONLY these Activity Types:
Showing results for: [ Information and Computing Sciences ]
This software package provides a web-based search tool for users to search on content over SISSVoc endpoints.
OzNome for Water - - Published 10 Nov 2017
This Software Package is a Vocabulary Viz Tool for visualising SKOS-based vocabularies deployed on SISSVoc service endpoints. This was developed collaboratively with Monash University.
A set of software tools for privacy preserving entity linkage.
- anonlink: A library for carrying out the low level hash comparisons required server side
- entity-service: Our linkage server implemen... moretation utilising the anonlink library
- clkhash: A client utility and library for turning personally identifiable information into bloom filter hashesless
Confidential Computing - - Published 15 Nov 2017
This collection contains data used in the 2017 pilot of an Australia-wide Open Data survey conducted by the Environmental Informatics group in CSIRO L&W. It contains list of open data collections and ... morerecords (government and research), and files with lists of web endpoints queried for its size, format and date. The data is then able to be used with a companion software package which performs the analytics and visualisation for the pilot survey.
The majority of open data collections and records used in this survey comes from sources listed in the Credit field, which were indexed in the CSIRO Knowledge Network (http://kn.csiro.au). Data from THREDDS data service sources (NCI, TPAC, AODN, CSIRO TDS) were not indexed by the CSIRO Knowledge Network but is included as part of this pilot survey representing significant parts of the open research data sector.
Files contained in this repository summarise various perspectives of the survey and uses Tab-separated values (TSV) format and JSON format. These formats were chosen for ease of use in the software package for analysis and visualisation. The software used will be published as a separate entry (DOI to be added soon). less
Operations DAMbusters - Digital asset metrics - Published 01 Nov 2017
A globally-optimal inlier set cardinality maximisation algorithm that jointly estimates optimal camera pose and optimal correspondences. The approach employs branch-and-bound to search the 6D space of... more camera poses, guaranteeing global optimality without requiring a pose prior. The geometry of SE(3) is used to find novel upper and lower bounds for the number of inliers and local optimisation is integrated to accelerate convergence.less
CPS - Basic Research - Vision - 2D-3D Registration - Published 07 Nov 2017
A functional big data execution framework with dependency and data provenance management. https://github.com/tiantianwdy/hdm
Legacy data - Data61 Engineering and Design - Published 07 Nov 2017
This is a tutorial demonstration of a DCM plug-in for drawing 3D geometrical shapes, and embedding them in a HTML document.
To view this document, the Google Chrome browser is recommended on the MS-Wi... morendows system.
The native Safari browser on an iOS device or the native browser on an Android device should also be fine.less
Non-destructive quality evaluation of additive-manufactured components - Data-constrained modelling - Published 01 Sep 2017
A program for generating atomistic models in real space by a combination of empirical potential energy minimization and the fitting of experimentally motivated constraints. Suitable inputs include ex... moreperimental data such as diffraction data (structure factor and radial distribution functions) and computational data (bond angle, bond length, coordination distributions). Output is a model in Cartesian coordinates in xyz format that can be visualized in standard visualization packages. A manual and tutorial are provided.less
MMM Research & Applications - MMM Software - Published 10 Oct 2017
A collection of images that are used for benchmarking insect classifications, with the aim of supporting reproducible research
RESEARCH - Machine Learning - Reproducible Research - Published 29 Jul 2017
These are benchmark datasets collected from public sources for reproducibility of experimental results.
The final dataset contains 25551 unique pedestrians, allowing for a dataset of over 50K images with mirroring. Additionally, TDB allows the generation of several permutations per source image in order... more to further bolster the training set. Large negative datasets will also be provided, although researchers training cascaded classifiers may require their own bootstrapped negative datasets. Apart from the datasets linked here the authors are willing, within reason, to produce further pedestrian datasets for the scientific community.less
6066000 - AutoMap Business - Toyota Grant - Pedestrian Dataset - Published 19 Jul 2017
Data61 / 2D3D dataset has been prepared for outdoor scene understanding which consists of a series of 2D panoramic images with corresponding 3D LIDAR point clouds. It contains 10 outdoor scenes, each ... moreof which includes a block of 3D point cloud together with several panoramic images. The number of 3D points in the scenes varies from 1 to 2 millions, and each scene contains between 11 and 21 panoramic images.less
6066000 - AutoMap Business - Toyota Grant - 2D3D Dataset - Published 17 Jul 2017
These models are the product of research undertaken under the 2017 APS Data Fellowship program. This collection included trained MITIE NER models that are ready for use with a standard MITIE library c... morelassifier. A summary Precision, Recall and F1 performance statistics table is included for future reference.less
Legacy data - NER model training - Published 02 Jun 2017
AliStat has been written to allow users to evaluate how complete their multiple sequence alignments (MSA) of nucleotides or amino acids are.
In particular, AliStat allows a user to generate four usef... moreul metrics (i.e. the completeness of an MSA (Ca), of a row in an MSA (Cr), of a column in an MSA (Cc), and of homologous sites in a sub-alignment of sequences i and j (Cij)), each of which will tell the user something useful about the distribution of missing data in an MSA.
The information thus obtained can be used to make educated decisions about how the data in the MSA should be analysed phylogenetically.
The output is a set of informative figures, tables, and sub-alignments.
In addition, AliStat produces information that meets the Minimum Reporting Standard (MRS) for MSAs, a concept that we introduce in Wong TFK, Kalyaanamoorthy S, Meusemann K, Yeates DK, Misof B, Jermiin LS. 2017. AliStat: Quality-control metrics for multiple sequence alignments. Nature Methods (in prep).less
CLSD Strategic: Solving Biodiversity Cha - Bioinformatics Software - Published 02 Jun 2017
A program to simulate the evolution of DNA under the mixture models of heterogeneity across lineage and heterogeneity across sites.
Hetero was written to facilitate the development and testing of phy... morelogenetic methods.
In particular, Hetero 2 allows the user to generate nucleotide sequences on a phylogenetic tree under complex conditions.
The complexity of the conditions are such that the evolutionary process can differ from site to site in the DNA as well as from edge to edge in the phylogenetic tree, with the evolutionary process being the variable.
The output is a set of alignments of nucleotides.
For further details, see the manual, which is included with the software.less
HAL-HAS is a program package, comprising HAL-BU, HAL-TD, and HAS, that was written to allow users to infer an optimal model of evolution for a given data set, a given tree.
We introduce a family of n... moreon-stationary and non-homogeneous mixture models that approximate the rate Heterogeneity Across Lineages (HAL) and the rate Heterogeneity Across Sites (HAS) without the assumption of an underlying predefined statistical distribution.less
A set of RDFgraphs representing the International [Chrono]stratigraphic Chart, comprising Turtle serializations of data from the2016 version, along with updated ontologies that define the structure of... more the data.
In this version the Geological Timescale Model is aligned with the W3C OWL-Time ontology https://www.w3.org/TR/owl-time/ for the temporal topology, with OGC GeoSw3PARQL http://www.opengeospatial.org/standards/geosparql for location data, and with the W3C SOSA/SSN ontology for samples. less
Connecting Information and People Strategic 16-17 - Spatial data on the web - Published 12 May 2017
A conical-shaped pin of cold sprayed titanium was prepared using a CGT Kinetiks 4000 system. After a synchrotron X-ray micro-CT scan the titanium pin was heat treated at 850 degree C for 4 hours. The ... moresecond micro-CT scan was performed. Both X-ray CT data were analysed using DCM for their 3D porosity distributions. less
Non-destructive quality evaluation of additive-manufactured components - Data-constrained modelling - Published 10 Aug 2017
A library for the Arduino platform which provides communication with a NATS streaming server.
Smart Infrastructure Systems - Build decentralised data processing system - Published 23 Feb 2017
This workbook contains the survey data reported in: "Sanderson, Todd; Reeson, Andrew; Box, Paul. Understanding and unlocking the value of public research data: OzNome social architecture report. Canbe... morerra: CSIRO; 2017. csiro:EP168075. https://doi.org/10.4225/08/58a5e8d940904"
There are two CSIRO Data Access Portal (DAP) surveys reported in the workbook, (1) a survey of depositors to the DAP, and (2) a survey of DAP withdrawers (users). Each of these were conducted under CSIRO Social Science Human Research Ethics Committee Approval: project 055/16 “Data Access Portal – costs and benefits for depositors and users”.
(1) The survey of depositors involved semi-structured interviews with depositors of research data collections on the DAP. This group were entirely composed of researchers within CSIRO. Depositors were interviewed using guiding questions which are presented here and in Appendix A of the report. A total of 15 data depositors from a wide variety of research disciplines accepted invitations and were interviewed. Because the interviews were semi-structured, not all questions were answered by respondents; the corresponding cells have been left blank. The interviews were conducted exclusively over the phone for a period of 30 minutes, during April 2016. In order to maintain their anonymity, some responses and some sections of responses have been removed. In these instances, a "YYYYY" will appear in the cell or text to indicate redaction.
(2) The survey of withdrawers (users) of the DAP involved a structured online survey with a focus on eliciting their assessment of the value of the data collections they were using. Withdrawers were presented with questions reported here and in Appendix B of the report. The survey was administered using a Survey Monkey application, to which a link was presented in a banner on the DAP webpage inviting users to participate. Banner advertising text: “Please help us understand the value of data in the Data Access Portal - to take part in our survey please visit https://www.surveymonkey.com/r/529MT2P”. The survey captured responses from 23 users over the period October 2016 - February 2017. less
OzNome Core - Data Access Portal - value of data survey - Published 22 Feb 2017
This references the open source release of the 3rd Eye Scene code library. 3rd Eye Scene is a visual debugger and debugging aid in the vain of the Robot Operating System (ROS) rviz or physics engine v... moreiewers such as Havok Visual Debugger or PhysX Visual Debugger. Whereas those tools are tightly bound to their respective SDKs, 3rd Eye Scene can be used to remotely visualise and debug any real time or non real time 3D algorithm. Conceptually, it can be thought of as a simple remote rendering application. A 3rd Eye Scene (3es) server may be embedded into any program, then 3es render commands may be interspersed throughout the program. The 3es viewer client application is then used to view, record and playback these render commands.
Legacy data - Debugging - Published 08 Dec 2016
This is one of two collection records. Please see the link below for the other collection of associated audio files.
Both collections together comprise an open clinical dataset of three sets of 101 n... moreursing handover records, very similar to real documents in Australian English. Each record consists of a patient profile, spoken free-form text document, written free-form text document, and written structured document.
This collection contains 3 sets of text documents.
Data Set 1 for Training and Development
The data set, released in June 2014, includes the following documents:
Folder initialisation: Initialisation details for speech recognition using Dragon Medical 11.0 (i.e., i) DOCX for the written, free-form text document that originates from the Dragon software release and ii) WMA for the spoken, free-form text document by the RN)
Folder 100profiles: 100 patient profiles (DOCX)
Folder 101writtenfreetextreports: 101 written, free-form text documents (TXT)
Folder 100x6speechrecognised: 100 speech-recognized, written, free-form text documents for six Dragon vocabularies (TXT)
Folder 101informationextraction: 101 written, structured documents for information extraction that include i) the reference standard text, ii) features used by our best system, iii) form categories with respect to the reference standard and iv) form categories with respect to the our best information extraction system (TXT in CRF++ format).
An Independent Data Set 2
The aforementioned data set was supplemented in April 2015 with an independent set that was used as a test set in the CLEFeHealth 2015 Task 1a on clinical speech recognition and can be used as a validation set in the CLEFeHealth 2016 Task 1 on handover information extraction. Hence, when using this set, please avoid its repeated use in evaluation – we do not wish to overfit to these data sets.
The set released in April 2015 consists of 100 patient profiles (DOCX), 100 written, and 100 speech-recognized, written, free-form text documents for the Dragon vocabulary of Nursing (TXT). The set released in November 2015 consists of the respective 100 written free-form text documents (TXT) and 100 written, structured documents for information extraction.
An Independent Data Set 3
For evaluation purposes, the aforementioned data sets were supplemented in April 2016 with an independent set of another 100 synthetic cases.
Legacy data - Generation of synthetic nursing handover data set - Published 21 Mar 2017
This is one of two collection records. Please see the link below for the other collection of associated text files.
The two collections together comprise an open clinical dataset of three sets of 10 ... morenursing handover records, very similar to real documents in Australian English. Each record consists of a patient profile, spoken free-form text document, written free-form text document, and written structured document.
This collection contains 3 X 100 spoken free-form audio files in WAV less
Lansdown pasture photograph series - nadir-pointing cameras on two separate 2.8m x 2.0m plots.
Series of digital photographs of pasture captured by two digital cameras 2.5 m above the ground in a dow... morenward-pointing position. Camera field of view approximately 2.8m x 2.0m at ground level. Images captured at 30 minute intervals from 2011-09-07 to 2013-08-10.
Cameras were Pentax Optio WG-1 digital cameras (13.8 megapixels). Images saved in JPG format.
Node 1081 (fenced): Lon. 146.85058, Lat. -19.66143
Node 1080 (unfenced): Lon. 146.85090, Lat. -19.66113
Distance between nodes approx. 50m.less
CLOSED-Application real-time sensor net - A pilot project combining multispectral proximal sensors and digital cameras for monitoring tropical pastures - Published 08 Sep 2016
A set of machine readable resources that are part of the output of a project undertaken for the Commonwealth Department of Environment, under the Essential Environmental Measures program. These repres... moreent an information model for vegetation sites data, comprising
- a UML model in a .EAP file (database for use with Sparx Systems Enterprise Architect)
- an OWL Ontology implementation
- RDF files (TTL format) of some example data
- XML files of the same example dataless
LWF-Water Resource Management-Consultanc - Information model for vegetation sites data - Published 28 Jul 2016