• Investing
  • Stock
  • Editor’s Pick
  • Economy
The Significant Deals
Stock

Exclusive-EU AI Act checker reveals Big Tech’s compliance pitfalls

by October 16, 2024
written by October 16, 2024

By Martin Coulter

LONDON (Reuters) – Some of the most prominent artificial intelligence models are falling short of European regulations in key areas such as cybersecurity resilience and discriminatory output, according to data seen by Reuters. 

The EU had long debated new AI regulations before OpenAI released ChatGPT to the public in late 2022. The record-breaking popularity and ensuing public debate over the supposed existential risks of such models spurred lawmakers to draw up specific rules around “general-purpose” AIs (GPAI).

Now a new tool designed by Swiss startup LatticeFlow and partners, and supported by European Union officials, has tested generative AI models developed by big tech companies like Meta (NASDAQ:META) and OpenAI across dozens of categories in line with the bloc’s wide-sweeping AI Act, which is coming into effect in stages over the next two years. 

Awarding each model a score between 0 and 1, a leaderboard published by LatticeFlow on Wednesday showed models developed by Alibaba (NYSE:BABA), Anthropic, OpenAI, Meta and Mistral all received average scores of 0.75 or above. 

However, the company’s “Large Language Model (LLM) Checker” uncovered some models’ shortcomings in key areas, spotlighting where companies may need to divert resources in order to ensure compliance. 

Companies failing to comply with the AI Act will face fines of 35 million euros ($38 million) or 7% of global annual turnover.

MIXED RESULTS 

At present, the EU is still trying to establish how the AI Act’s rules around generative AI tools like ChatGPT will be enforced, convening experts to craft a code of practice governing the technology by spring 2025. 

But LatticeFlow’s test, developed in collaboration with researchers at Swiss university ETH Zurich and Bulgarian research institute INSAIT, offers an early indicator of specific areas where tech companies risk falling short of the law. 

For example, discriminatory output has been a persistent issue in the development of generative AI models, reflecting human biases around gender, race and other areas when prompted. 

When testing for discriminatory output, LatticeFlow’s LLM Checker gave OpenAI’s “GPT-3.5 Turbo” a relatively low score of 0.46. For the same category, Alibaba Cloud’s “Qwen1.5 72B Chat” model received only a 0.37.  

Testing for “prompt hijacking”, a type of cyberattack in which hackers disguise a malicious prompt as legitimate to extract sensitive information, the LLM Checker awarded Meta’s “Llama 2 13B Chat” model a score of 0.42. In the same category, French startup Mistral’s “8x7B Instruct” model received 0.38. 

“Claude 3 Opus”, a model developed by Google-backed Anthropic, received the highest average score, 0.89. 

The test was designed in line with the text of the AI Act, and will be extended to encompass further enforcement measures as they are introduced. LatticeFlow said the LLM Checker would be freely available for developers to test their models’ compliance online. 

Petar Tsankov, the firm’s CEO and cofounder, told Reuters the test results were positive overall and offered companies a roadmap for them to fine-tune their models in line with the AI Act. 

“The EU is still working out all the compliance benchmarks, but we can already see some gaps in the models,” he said. “With a greater focus on optimising for compliance, we believe model providers can be well-prepared to meet regulatory requirements.”

Meta declined to comment. Alibaba, Anthropic, Mistral, and OpenAI did not immediately respond to requests for comment. 

While the European Commission cannot verify external tools, the body has been informed throughout the LLM Checker’s development and described it as a “first step” in putting the new laws into action.

A spokesperson for the European Commission said: “The Commission welcomes this study and AI model evaluation platform as a first step in translating the EU AI Act into technical requirements.”

($1 = 0.9173 euros)

This post appeared first on investing.com
0 comment
0
FacebookTwitterPinterestEmail

previous post
Australia stocks lower at close of trade; S&P/ASX 200 down 0.41%
next post
Chip stocks drag Asian markets lower; China shares volatile

You may also like

BASF results down on impairments, restructuring

January 27, 2025

European chipmakers slump as traders gauge DeepSeek AI...

January 27, 2025

Nasdaq futures tumble as China’s AI push rattles...

January 27, 2025

China Vanke’s CEO, chairman resign amid growing liquidity...

January 27, 2025

Fuji Media, rocked by sexual misconduct allegations, says...

January 27, 2025

Italy’s MPS shares fall ahead of Mediobanca board...

January 27, 2025

British Land stock drops following stake sale

January 27, 2025

UMG shares rally after new multi-year pact with...

January 27, 2025

BASF shares indicated 3% lower as impairments drag...

January 27, 2025

Ryanair cuts 2026 traffic forecast amid ongoing Boeing...

January 27, 2025
Become a VIP member by signing up for our newsletter. Enjoy exclusive content, early access to sales, and special offers just for you! As a VIP, you'll receive personalized updates, loyalty rewards, and invitations to private events. Elevate your experience and join our exclusive community today!








    By opting in you agree to receive emails from us and our affiliates. Your information is secure and your privacy is protected.

    Recent Posts

    • China outlines more controls on exports of rare earths and technology

      October 10, 2025
    • Paramount acquires Bari Weiss’ The Free Press, naming her the top editor of CBS News

      October 7, 2025
    • YouTube to pay $24 million to settle Trump lawsuit

      October 1, 2025
    • Charlie Javice sentenced to 7 years in prison for fraudulent $175M sale of aid startup

      October 1, 2025

    Categories

    • Economy (245)
    • Editor's Pick (3,646)
    • Investing (651)
    • Stock (6,426)

    Latest News

    • China outlines more controls on exports of rare earths and technology
    • Paramount acquires Bari Weiss’ The Free Press, naming her the top editor of CBS News

    Popular News

    • ANZ’s outgoing CEO to forfeit 2024 long-term variable remuneration
    • Japan stocks lower at close of trade; Nikkei 225 down 1.92%

    About The Significant deals

    • About us
    • Contacts
    • Privacy Policy
    • Terms & Conditions

    Copyright © 2025 thesignificantdeals.com | All Rights Reserved

    The Significant Deals
    • Investing
    • Stock
    • Editor’s Pick
    • Economy