LibGuides: Manage Your Research Data: Data Privacy and Security

Data privacy and security are crucial in AI research, especially when working with sensitive data such as personal health information or financial records. Researchers must implement rigorous data management practices to protect information from unauthorised access and misuse, or breaches, particularly before the data is input into AI tools.

For example, in the case of healthcare applications, managing health data and addressing the ethical implications of AI requires careful attention. Researchers must ensure informed consent, implement strict data sharing protocols, and ensure AI systems do not misuse or expose personal information.

Best practices to prevent sensitive data leakage into AI systems

Data minimisation	Only input the minimum amount of sensitive data required for the AI tool’s operation. From an ethical point, data minimisation helps prevent unnecessary exposure of personal data and minimises risks of misuse or discrimination.
Data encryption	Encrypt sensitive data both at rest (when stored) and in transit (when being transferred). This ensures that data remains protected from unauthorised access at all stages, including before and after being input into AI tools.
Compliance with regulations	Ensure that all data handling, storage, and processing practices fully comply with privacy laws, including the Privacy Act 1988 and any specific regulations. This is crucial before any data is entered into AI tools.
Access control	To protect sensitive data, access to it must be restricted to authorised personnel only. Encryption helps to prevent unauthorised access or misuse. Regular security audits are essential to check compliance with access restrictions and encryption protocols.
Anonymisation and de-identification	Before inputting data into AI models, anonymise or de-identify it to remove personally identifiable information. This ensures that even if data is exposed, it cannot be linked back to individuals.
Data integrity	Ensure that the data input into AI tools is accurate, relevant, and up-to-date. Maintaining data integrity helps prevent biases and ensures that sensitive information is used correctly and responsibly in AI-driven research. Maintaining data integrity also involves documenting the source and history of the data (data provenance) to ensure its authenticity and reliability for AI-driven analysis.

Manage Your Research Data: Data Privacy and Security

Best practices to prevent sensitive data leakage into AI systems

Library Contact

Library Links

Quick Links

More about ECU