Original article was published by Abhishek Dabas on Artificial Intelligence on Medium
Ethics of Artificial Intelligence
Bias & Fairness
Check out my previous blogs if you want to read more and understand topics such as what is bias, its real-world use cases, and the mitigation methods.
We will conclude with an overview of Bias & Fairness in AI with some key takeaways:
- A lot of people think that AI is some kind of magic when it is not!! It reflects the data(training data). AI system takes the examples(datasets) and learns from it. The essential thing is to know when and where to use AI models. “If there is nothing to learn from learning is impossible”.
- The best way to check the model or the system is to Test It!!! (just like we test everything else, if you need a driving license you give a driving test for that, to prove your driving skills are good enough for you to drive on the road). We trust what we have experienced before if a calculator works well and gives us the right results, then we don’t need to get into how it works but just trust it, as it never failed us!! There are many methods to do this check here.
- Ensuring that AI systems can effectively make decisions and researchers focus on these long term goals with AI by considering the impact that the technology can make towards the real-world in the near future!! Such things can increase research and standards to reduce bias in AI.
- There are still a lot of biases that we are not even aware of. More than 180 Human biases have been found. Read more here
- Bias can ever be totally removed. Even the attempt to remove bias creates bias of its own — it’s a myth to even try to achieve a bias-free world!!
- Because there are different kinds of bias and it is impossible to minimize all kinds simultaneously, this will always be a trade-off. The best approach will have to be decided on a case by case basis, by carefully considering the potential harms from using the algorithm to make decision
- Machine learning, by nature, is a form of statistical discrimination: we train machine learning models to make decisions (to discriminate between options) based on past data. I think it’s debatable!!
- The more we learn about bias in AI, the more we learn about bias in Humans, which can ultimately help us, humans, in making fair decisions.
- “In particular, we need to stop building computer systems that merely get better and better at detecting statistical patterns in data sets — often using an approach known as deep learning — and start building computer systems that from the moment of their assembly innately grasp three basic concepts: time, space and causality. Without the concepts of time, space, and causality, much of common sense is impossible.”— New York Times
- Taking this problem into consideration, we face a choice. We can stick with today’s approach to A.I. and greatly restrict what the machines are allowed to do (we end up with autonomous-vehicle crashes and machines that perpetuate bias rather than reduce it). Or we can shift our approach to A.I. in the hope of developing machines that have a rich enough conceptual understanding of the world that we need not fear about their operation.
- If the underlying data is inherently biased or doesn’t contain a diverse representation of the target groups, the AI algorithms cannot produce accurate and fair outputs.
- ML Models should reflect the data and the data should reflect the reality.
- A lot of companies are working on building Responsible AI, which shows that the future is bright for AI.
- AI Now: A research institute at New York University that examines the social implications of artificial intelligence. Bias and inclusion is one of their core themes for research.
- EU general data protection regulation contains a right to explanation, which has raised the concern for building an accountable and responsible system.
- Data really really matters, we need to know things like:
- Understanding the skews and correlations within the data
- Testing amongst multiple training and testing dataset
- combining data source from multiple sources
- specifying held out test set for hard use cases
- domain expertise to identify additional signals