Why we can’t blame tech for the exam result scandal

Jonathan Birch

26 Aug 2020

‘Like the sinking of the Titanic’. A contributor for The Times Education Supplement did not hold back on their assessment of the UK Government’s management of this year’s academic results.

And who could blame them? The algorithm used by Ofqual to determine final grades – in lieu of students physically sitting the exams – was not only deeply flawed but regarded by many as purposely cruel.

And just as the cruise liner famously sank, so did the hearts of thousands after A-Level results day revealed that their final grades weren’t as they had hoped. Simply because a grading model dreamt up in April said so.

A staggering 40% of results were downgraded, and those from poorer backgrounds were said to be most negatively affected by the model backed by the Secretary of State.

Whilst the UK Government is working to fix this disaster, the manner in which this has all happened raises questions about how we use algorithms and equations to make such important decisions for society.

Not the first time

A recent Wired article by Matt Burgess explores these questions in detail, and highlights how the A-Level results debacle isn’t actually the first major failure of this kind for the UK public sector. Earlier this month, the Home Office dropped a “racist” visa decision algorithm that graded people on their nationalities, and the current system deciding Universal Credit has also been judged by many to be unfit for purpose.

The difference between the exam results fiasco and the other examples referenced by Burgess, is that the model used by Ofqual wasn’t powered by technology but by an equation known as ‘Pkj = (1-rj)Ckj + rj(Ckj + qkj – pkj)’. Those not already familiar with how this equation has been structured could do worse than read the Guardian’s thorough breakdown of each section and why it was always doomed to fail.

What these models all have in common is that they have done a catastrophic job of making automated socio-economic choices based on historic data. But their failings do not stem from a decision to use tailored equations, artificial intelligence (AI) or machine learning. They come from the thinking applied to each of these tools.

Thinking ethically

Take AI for example. Kathy Baxter, an Architect of Ethical AI Practice at Salesforce, explained that we can only trust AI to make ethically-sound decisions when the technology has been structured and used in a truly ethical way.

One way to do this is through diversity of thought. Organisations can reduce exclusion and bias from their AI models by looking at the inherent discrimination –accidental or otherwise – sitting within the teams building and applying it. The most obvious place to start is by assessing who is being hired and promoted based on gender, age, race, and social class.

Working with more diverse teams can help prevent thinking becoming restricted and thereby limit discriminatory bias. But striking this kind of cultural balance is incredibly difficult when there is a major shortage of skilled AI workers to choose from. Tencent research found that there are only 300,000 skilled AI engineers globally, even though millions are needed. And a recent report from the World Economic Forum found that gender diversity alone is a huge issue within the AI field – only 22% of the global workforce is female.

Hiring challenges

The Telegraph’s Technology Editor Robin Pagnamenta points to these challenges as contributing factors to the government’s recent algorithmic and modelling woes. He stresses that within the government, only limited expertise exists, which is forcing civil servants to rely on third party providers who may have little vested interest in the consequences of their work.

But hiring that expertise in-house is difficult when salaries for skilled workers have skyrocketed due to the skills shortage. Building a diverse team of technically skilled experts is even more challenging for the same reasons, especially when the talent pool is so restricted.

In the case of this year’s academic results, the skillsets of the people responsible for pulling together the statistical model are decidedly less important. After all, AI and machine learning technologies were not part of the process that has caused so much stress and upheaval to young people across the UK.

But when Ofqual’s chief regulator Sally Collier announced yesterday that she was stepping down in light of chaos, it was said that “the fault was not hers alone”, and that “ministers have questions to answer over the extent to which they scrutinised and challenged the methodology and reliability of the statistical model, particularly given the enormity of the task and the importance of getting it right”.

The exact skillsets might not be crucially important. But the range of minds involved will be. And how the government and Ofqual thought up this mess is a question we would all surely like an answer for.

Cookie	Duration	Description
cookielawinfo-checbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-advertisement	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Advertisement" category .
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
CookieLawInfoConsent	1 year	CookieYes sets this cookie to store the user consent.
JSESSIONID	session	The JSESSIONID cookie is used by New Relic to store a session identifier so that New Relic can monitor session counts for an application.
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Cookie	Duration	Description
__cf_bm	30 minutes	This cookie, set by Cloudflare, is used to support Cloudflare Bot Management.
aka_debug	session	Vimeo sets this cookie which is essential for the website to play video functionality.
sp_landing	1 day	The sp_landing is set by Spotify to implement audio content from Spotify on the website and also registers information on user interaction related to the audio content.
sp_t	1 year	The sp_t cookie is set by Spotify to implement audio content from Spotify on the website and also registers information on user interaction related to the audio content.

Cookie	Duration	Description
_ga	1 year 1 month 4 days	The _ga cookie, installed by Google Analytics, calculates visitor, session and campaign data and also keeps track of site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognize unique visitors.
_ga_*	1 year 1 month 4 days	Google Analytics sets this cookie to store and count page views.
_gat_UA-*	1 minute	Google Analytics sets this cookie for user behaviour tracking.n
_gid	1 day	Installed by Google Analytics, _gid cookie stores information on how visitors use a website, while also creating an analytics report of the website's performance. Some of the data that are collected include the number of visitors, their source, and the pages they visit anonymously.
vuid	1 year 1 month 4 days	Vimeo installs this cookie to collect tracking information by setting a unique ID to embed videos to the website.

Cookie	Duration	Description
loglevel	never	No description available.
VIEWER_COUNTRY	1 day	No description

Why we can’t blame tech for the exam result scandal

SHARE

Jonathan Birch

Not the first time

Thinking ethically

Hiring challenges