Secondary analysis of existing data is common practice in many academic fields. Existing data can be used to conduct new research, to test hypotheses or replicate findings from previous studies.
Existing data can complement your own data collection and provide rich enhancements including demographic, temporal or geospatial layers. Using existing data can save time and effort in the data collection process.
Researchers can access existing clinical trials data for health research, including evidence synthesis, secondary analysis, reproducibility, replication and validation studies, or education and methods development.
Australia's Productivity Commission report Data Availability and Use highlights the benefits of existing data for research, policy, decision making and innovation. Australian Research Data Commons (ARDC) provide case studies and examples on the many uses of existing data for research. The datasets available via this guide are good examples of F.A.I.R data, as they are findable, accessible, interoperable and reusable.
Use the discipline tabs to find existing data available publicly or to researchers by application and approval. Consider browsing outside your discipline as datasets from different scientific domains can be of value to diverse research projects.
For information on sources and tools for textual data, refer to the Text mining and analysis library guide.
For guidance on respectful and ethical access and use of data from research relating to First Nations peoples, refer to the Aboriginal and Torres Strait Islanders Research guide.
Assessing secondary data is much like evaluating the quality of a research paper. Consider factors that relate to the reliability and validity of research results, such as whether:
A data reuser is often not familiar with the secondary data. Take some time to:
More guidance on assessing existing data for secondary research is available from Rosinger & Ice (2019).
For advice on assessing data from research relating to First Nations peoples, including factors such as bias and diversity and complexity vs stereotypes and generalisation, refer to the Aboriginal and Torres Strait Islanders Research guide.
Published research data is cited in the same way as other scholarly outputs, with variation in styles and formats.
Griffith University Library Referencing Guides illustrates how to cite data according to common styles.
The same professional and ethical treatment is required for research that uses existing or new data. See Griffith University's Responsible Conduct of Research Policy for further details. In many cases access to external data is by application and reviewed by the custodian organization, who may assess the validity of the research project proposal and ethics approvals. Explore The Data Science Ethos tool for ways to apply an ethics centred approach to research projects.
Terms and conditions will be placed on most external datasets which may include, use for research purposes only, statistical training or user competency, strict methods of de-identification, secure storage and destruction, attribution, and submission of findings or publications for review or approval.
Whilst raw data is not protected under Australian copyright law, the presentation of data, including images, tables, or database structures can still be protected by copyright and require expressed permission for publishing replications. See the Australian Bureau of Statistics (ABS) copyright example.
Some data may be available under Creative Commons or other open licenses, providing clear guidance how the data can be used and attributed. Most data from data.qld.gov.au is available under an open license.
Access to existing data can also be provided via contractual agreements, seek advice from the Office for Research on appropriate or standard terms and conditions.
Training, specialist advice and support
Griffith University digital skills self-paced tutorials
See the Collect and analyse tab under Workshops for self-paced tutorials on LimeSurvey, REDCap at Griffith, Introduction to data wrangling with OpenRefine, and Advanced data wrangling with OpenRefine.
Griffith University data workshops for researchers via RED
Training includes, managing data, data cleaning and processing, data analysis, visualisation methods and tools, survey tools and more.
eResearch Services at Griffith University
Specialist IT services for researchers including high performance computing, research data storage, data collection tools, and programming workshops.
Hacky Hour
Get online help from eResearch & Library staff with OpenRefine, R, Python, SQL and Bash coding. Practice your new programming skills in a supportive environment. Learn about HPC or virtual machines, what is available and how to use them, and catch up with other researchers learning to wrangle their data. Access the online sessions each Thursday via the calendar.
External training
Programming Historian - novice-friendly, peer-reviewed online tutorials that help humanists learn a wide range of digital tools, techniques, and workflows to facilitate research and teaching. Available in English, Spanish and French.
Ebooks