APAC CIOOutlook

Advertise

with us

  • Technologies
      • Artificial Intelligence
      • Big Data
      • Blockchain
      • Cloud
      • Digital Transformation
      • Internet of Things
      • Low Code No Code
      • MarTech
      • Mobile Application
      • Security
      • Software Testing
      • Wireless
  • Industries
      • E-Commerce
      • Education
      • Logistics
      • Retail
      • Supply Chain
      • Travel and Hospitality
  • Platforms
      • Microsoft
      • Salesforce
      • SAP
  • Solutions
      • Business Intelligence
      • Cognitive
      • Contact Center
      • CRM
      • Cyber Security
      • Data Center
      • Gamification
      • Procurement
      • Smart City
      • Workflow
  • Home
  • CXO Insights
  • CIO Views
  • Vendors
  • News
  • Conferences
  • Whitepapers
  • Newsletter
  • Awards
Apac
  • Artificial Intelligence

    Big Data

    Blockchain

    Cloud

    Digital Transformation

    Internet of Things

    Low Code No Code

    MarTech

    Mobile Application

    Security

    Software Testing

    Wireless

  • E-Commerce

    Education

    Logistics

    Retail

    Supply Chain

    Travel and Hospitality

  • Microsoft

    Salesforce

    SAP

  • Business Intelligence

    Cognitive

    Contact Center

    CRM

    Cyber Security

    Data Center

    Gamification

    Procurement

    Smart City

    Workflow

Menu
    • Data Center
    • Cyber Security
    • Hotel Management
    • Workflow
    • E-Commerce
    • Business Intelligence
    • MORE
    #

    Apac CIOOutlook Weekly Brief

    ×

    Be first to read the latest tech news, Industry Leader's Insights, and CIO interviews of medium and large enterprises exclusively from Apac CIOOutlook

    Subscribe

    loading

    THANK YOU FOR SUBSCRIBING

    • Home
    • Data Center
    Editor's Pick (1 - 4 of 8)
    left
    Hybrid IT - The New Reality

    Ramesh Munamarty, Group CIO, International SOS

    Faster, Safer Data Hosting Starts with a Map

    Mark Bauer, Managing Director & Co-Lead Of The Data Center Solutions, JLL

    Universal Digital Identity-How to Get it Right?

    Dr. Michael Gorriz, Group CIO, Standard Chartered Bank

    Data Center of the Future

    Ari Bose, CIO, Brocade

    Centricity of Data Science in the IT World

    Harpreet Kaintel, CIO, ZenithOptimedia Group

    Encryption-is it enough?

    Jerry Irvine, EVP, CIO, Prescient Solutions

    The Data Experience Revolution: Moving Beyond Access to Action

    Mazen Kassis, Head of Data & Analytics, Foodstuffs North Island

    Harvesting The Future: The Transformative Impact Of Ai On Agriculture

    Jeremy Groeteke, Global Head Of It & Digital Strategy, Vegetables & Flowers, Computational Agronomy, Syngenta Group

    right

    Ethical AI with Synthetic Data

    Sachin Tonk, Director Data , Advanced Analytics and Privacy, Standard Chartered Bank

    Tweet
    content-image

    Sachin Tonk, Director Data , Advanced Analytics and Privacy, Standard Chartered Bank

    Recent advancements in Big Data have encouraged enterprises to invest in data strategies to fully expand into the operationalization of ethical AI principles, policies, and frameworks. For example, Netflix saves $1 billion annually by using Big Data. This is evident by the amount of investments organizations are willing to make in the data domain, as it is heralded as the new currency of the digital age. However, real data is often biased, which means that there is an unexpected or unwanted correlation among different data features. This is because collecting real data is an uncontrolled process that naturally encodes biases present in the real world. The problem is, biased data can have serious implications for enterprises.

    To elaborate the above claim, let us look at an example. Enterprises use fraud detection solutions that heavily rely on transaction patterns, geo-location, and demographics data to identify unusual behaviours or suspicious activities. This may potentially lead to unintended as well as unknown data biases such as a particular demographic group being flagged out for no apparent reason. If we ask ourselves, in this situation, how can the AI / ML models, their testing and data handling process be made fair and ethical to ensure that there are no biases present in a dataset against a particular group of individuals while protecting against potential fraud, is an open question.

    Biased data can cause inaccurate AI / ML models to be trained and deployed, which can perpetuate discriminations. Regulations around the world are starting to get stricter on AI, especially those that could potentially harm the fundamental rights of citizens such as being non-discriminatory. In April 2021, The European Commission (EC) proposed a brand-new regulatory framework to protect EU citizens from AI because of its huge transparency issues. This has widespread implications for businesses globally. Concretely, it affects foreign companies selling AI-based services in the EU and also, any providers of AI systems that are located in a non-EU country but their infrastructure is hosted in the EU such as cloud services.

    Regulations around the world are starting to get stricter on AI, especially those that could potentially harm the fundamental rights of citizens such as being non-discriminatory

    Any non-compliance can result in heavy penalties of up to 6 percent of the annual revenue of a company.

    Data used to be a data science problem, but it is now becoming a compliance problem as well. With the release of Ethical Guidelines for Trustworthy AI by the EU and Fairness, Ethics, Accountability and Transparency (FEAT) principles by the Monetary Authority of Singapore (MAS), AI models and consequently, the data they are trained on, must be fair and ethical. This poses a question, if real data is biased and its acquisition process cannot be controlled, then what is the alternative? The answer to the above question is AI powered synthetic data to give data owners the option to control the generation process itself. This means that biases present in the real data can be removed with synthetic data. How? Let us find out.

    If an ML model is trained on a dataset that contains 70 percent males and 30 percent females, it will be biased towards males because of the skewed distribution. This introduces AI transparency issues that is a huge debate in the financial industry as well as the extended research community. This can be effectively rectified by giving enterprises the ability to generate a dataset that contains balanced distributions, i.e., 50 percent males and 50 percent females from a dataset that had an imbalanced male-to-female ratio. This is possible with AI generative models that operate on data types, so the scope of bias correction is much broader. As such, it can be applied among any number of data features, giving organizations the ability to be fair, ethical, and inclusive with their data practices. Extending this example, it can be applied to literally any data feature that is present in real data that can include gender, underrepresented regions, and new users with no background credit history, salary levels, and different age bands.

    Data bias is now becoming one of biggest challenges for enterprises and a barrier towards mass AI adoption. It may be introduced anywhere within AI / ML model pipelines, from data acquisition to modelling and how model outputs are used for actionable insights and decisions. Even these issues are somehow magically fixed, the inherent challenge is still left; real datasets are not fair because the real world is not fair. Therefore, the only feasible way to train AI / ML models at scale for the world we want to live in, is via AI powered data synthesis to create synthetic datasets that are fair in terms of legally protected qualities and other important dimensions/features. AI fairness is a relatively new topic, but synthetic data will be the most important tool for avoiding damaging trends in the actual world. Indeed, there are early indications that synthetic data can be very useful while maintaining AI / ML metrics that match the ones organizations will get with real data.

    tag

    Big Data

    Fraud

    Financial

    Weekly Brief

    loading
    Data Center Cooling Solution Company of the Year in APAC - 2025
    ON THE DECK

    Data Center 2024

    Previous Next

    I agree We use cookies on this website to enhance your user experience. By clicking any link on this page you are giving your consent for us to set cookies. More info

    Read Also

    Loading...
    Copyright © 2025 APAC CIOOutlook. All rights reserved. Registration on or use of this site constitutes acceptance of our Terms of Use and Privacy and Anti Spam Policy 

    Home |  CXO Insights |   Whitepapers |   Subscribe |   Conferences |   Sitemaps |   About us |   Advertise with us |   Editorial Policy |   Feedback Policy |  

    follow on linkedinfollow on twitter follow on rss
    This content is copyright protected

    However, if you would like to share the information in this article, you may use the link below:

    https://data-center.apacciooutlook.com/views/ethical-ai-with-synthetic-data-nwid-8501.html