A Framework for Safe AI: Data Governance and Ecosystem Structure

  • Wei Tek Tsai*
  • , Li Zhang
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Large Language Models have become a foundational component of modern artificial intelligence, but their development is often hindered by inadequate data governance, resulting in challenges such as hallucinations, intellectual property violations, and security vulnerabilities. In light of emerging regulatory requirements, this paper presents a Collaborative Safe AI Framework (CSAIF) for building safe AI systems through robust data lifecycle management and ecosystem collaboration. This paper analyzes governance principles drawn from the U.S. Blueprint for an AI Bill of Rights and the EU AI Act, emphasizing transparency, traceability, explainability, and auditability. Existing industry practices are reviewed to identify current strengths and limitations. This paper then introduces an approach that treats data as verifiable digital assets and a Data Container architecture to encapsulate both content and governance metadata. This design enables version control, access management, data sovereignty, and usage logging across the AI model lifecycle. The CSAIF defines the responsibilities of data providers, validation entities, application developers, and regulatory actors, and outlines a process that ensures data integrity, lawful use, and accountability. By integrating technical safeguards with operational oversight, the proposed CSAIF establishes a trustworthy foundation for developing and deploying AI models in compliance with legal and ethical standards.

Original languageEnglish
Title of host publicationProceedings - 2025 IEEE International Conference on Artificial Intelligence Testing, AITest 2025
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages11-23
Number of pages13
ISBN (Electronic)9798331589134
DOIs
StatePublished - 2025
Externally publishedYes
Event7th IEEE International Conference on Artificial Intelligence Testing, AITest 2025 - Tucson, United States
Duration: 21 Jul 202524 Jul 2025

Publication series

NameProceedings - 2025 IEEE International Conference on Artificial Intelligence Testing, AITest 2025

Conference

Conference7th IEEE International Conference on Artificial Intelligence Testing, AITest 2025
Country/TerritoryUnited States
CityTucson
Period21/07/2524/07/25

Keywords

  • blockchain
  • data governance
  • large language models
  • regulatory compliance

Fingerprint

Dive into the research topics of 'A Framework for Safe AI: Data Governance and Ecosystem Structure'. Together they form a unique fingerprint.

Cite this