Close Menu
UK Daily: Tech, Science, Business & Lifestyle News UpdatesUK Daily: Tech, Science, Business & Lifestyle News Updates
    What's Hot

    Dartford Crossing QEII Bridge to close this weekend

    December 15, 2025

    M1 northbound between J13 and J14 | Northbound | Broken down vehicle

    December 14, 2025

    Grok got crucial facts wrong about Bondi Beach shooting

    December 14, 2025
    Facebook X (Twitter) Instagram
    Trending
    • Dartford Crossing QEII Bridge to close this weekend
    • M1 northbound between J13 and J14 | Northbound | Broken down vehicle
    • Grok got crucial facts wrong about Bondi Beach shooting
    • Councillors approve next steps for Hailsham housing development
    • Season 2 Streaming Details – Hollywood Life
    • Mesa shuts down credit card that rewarded cardholders for paying their mortgages
    • Memecoins Are Not Dead, but Will Return in Another Form: Crypto Exec
    • How Did the Actor Die? – Hollywood Life
    • London
    • Kent
    • Glasgow
    • Cardiff
    • Belfast
    Facebook X (Twitter) Instagram YouTube
    UK Daily: Tech, Science, Business & Lifestyle News UpdatesUK Daily: Tech, Science, Business & Lifestyle News Updates
    Subscribe
    Monday, December 15
    • Home
    • News
      1. Kent
      2. London
      3. Belfast
      4. Birmingham
      5. Cardiff
      6. Edinburgh
      7. Glasgow
      8. Liverpool
      9. Manchester
      10. Newcastle
      11. Nottingham
      12. Sheffield
      13. West Yorkshire
      Featured

      ‘Miniature’ mountain creature with ‘squeaker’-like call discovered as new species

      Science November 9, 2023
      Recent

      Dartford Crossing QEII Bridge to close this weekend

      December 15, 2025

      M1 northbound between J13 and J14 | Northbound | Broken down vehicle

      December 14, 2025

      Grok got crucial facts wrong about Bondi Beach shooting

      December 14, 2025
    • Lifestyle
      1. Celebrity
      2. Fashion
      3. Food
      4. Leisure
      5. Social Good
      6. Trending
      7. Wellness
      8. Event
      Featured

      Season 2 Streaming Details – Hollywood Life

      Celebrity December 14, 2025
      Recent

      Season 2 Streaming Details – Hollywood Life

      December 14, 2025

      How Did the Actor Die? – Hollywood Life

      December 14, 2025

      Who Is Josh Dun? 5 Things to Know About Debby Ryan’s Husband – Hollywood Life

      December 14, 2025
    • Science
    • Business
    • Sports

      Chatham Town through to round four, Maidstone United beaten by Yeovil on penalties

      December 13, 2025

      League 2 match reaction from Gills boss Gareth Ainsworth

      December 13, 2025

      Whitstable Town go five points clear, nine-man Larkfield & New Hythe lose at Phoenix Sports, Bearsted up to third

      December 13, 2025

      Leaders Folkestone Invicta win derby at Dartford, two wins in a row for Ashford United, Sittingbourne and Sheppey United hit the goal trail

      December 13, 2025

      League 2 match report from Priestfield Stadium

      December 13, 2025
    • Politics
    • Tech
    • Property
    • Press Release
    UK Daily: Tech, Science, Business & Lifestyle News UpdatesUK Daily: Tech, Science, Business & Lifestyle News Updates
    Home » Giskard’s open-source framework evaluates AI models before they’re pushed into production

    Giskard’s open-source framework evaluates AI models before they’re pushed into production

    bibhutiBy bibhutiNovember 14, 2023 Tech No Comments4 Mins Read
    Facebook Twitter LinkedIn WhatsApp Telegram
    Share
    Facebook Twitter LinkedIn Telegram WhatsApp


    Giskard is a French startup working on an open-source testing framework for large language models. It can alert developers of risks of biases, security holes and a model’s ability to generate harmful or toxic content.

    While there’s a lot of hype around AI models, ML testing systems will also quickly become a hot topic as regulation is about to be enforced in the EU with the AI Act, and in other countries. Companies that develop AI models will have to prove that they comply with a set of rules and mitigate risks so that they don’t have to pay hefty fines.

    Giskard is an AI startup that embraces regulation and one of the first examples of a developer tool that specifically focuses on testing in a more efficient manner.

    “I worked at Dataiku before, particularly on NLP model integration. And I could see that, when I was in charge of testing, there were both things that didn’t work well when you wanted to apply them to practical cases, and it was very difficult to compare the performance of suppliers between each other,” Giskard co-founder and CEO Alex Combessie told me.

    There are three components behind Giskard’s testing framework. First, the company has released an open-source Python library that can be integrated in an LLM project — and more specifically retrieval-augmented generation (RAG) projects. It is quite popular on GitHub already and it is compatible with other tools in the ML ecosystems, such as Hugging Face, MLFlow, Weights & Biases, PyTorch, Tensorflow and Langchain.

    After the initial setup, Giskard helps you generate a test suite that will be regularly used on your model. Those tests cover a wide range of issues, such as performance, hallucinations, misinformation, non-factual output, biases, data leakage, harmful content generation and prompt injections.

    “And there are several aspects: you’ll have the performance aspect, which will be the first thing on a data scientist’s mind. But more and more, you have the ethical aspect, both from a brand image point of view and now from a regulatory point of view,” Combessie said.

    Developers can then integrate the tests in the continuous integration and continuous delivery (CI/CD) pipeline so that tests are run every time there’s a new iteration on the code base. If there’s something wrong, developers receive a scan report in their GitHub repository, for instance.

    Tests are customized based on the end use case of the model. Companies working on RAG can give access to vector databases and knowledge repositories to Giskard so that the test suite is as relevant as possible. For instance, if you’re building a chatbot that can give you information on climate change based on the most recent report from the IPCC and using a LLM from OpenAI, Giskard tests will check whether the model can generate misinformation about climate change, contradicts itself, etc.

    Image Credits: Giskard

    Giskard’s second product is an AI quality hub that helps you debug a large language model and compare it to other models. This quality hub is part of Giskard’s premium offering. In the future, the startup hopes it will be able to generate documentation that proves that a model is complying with regulation.

    “We’re starting to sell the AI Quality Hub to companies like the Banque de France and L’Oréal — to help them debug and find the causes of errors. In the future, this is where we’re going to put all the regulatory features,” Combessie said.

    The company’s third product is called LLMon. It’s a real-time monitoring tool that can evaluate LLM answers for the most common issues (toxicity, hallucination, fact checking…) before the response is sent back to the user.

    It currently works with companies that use OpenAI’s APIs and LLMs as their foundational model, but the company is working on integrations with Hugging Face, Anthropic, etc.

    Regulating use cases

    There are several ways to regulate AI models. Based on conversations with people in the AI ecosystem, it’s still unclear whether the AI Act will apply to foundational models from OpenAI, Anthropic, Mistral and others, or only on applied use cases.

    In the latter case, Giskard seems particularly well positioned to alert developers on potential misuses of LLMs enriched with external data (or, as AI researchers call it, retrieval-augmented generation, RAG).

    There are currently 20 people working for Giskard. “We see a very clear market fit with customers on LLMs, so we’re going to roughly double the size of the team to be the best LLM antivirus on the market,” Combessie said.



    Source link

    Featured Just In Top News
    Share. Facebook Twitter LinkedIn Email
    Previous ArticleThree sheep 'may have been deliberately run over' near Maidstone, say police
    Next Article Parvati Shallow Seemingly Confirms Relationship With Mae Martin – Hollywood Life
    bibhuti
    • Website

    Keep Reading

    Dartford Crossing QEII Bridge to close this weekend

    M1 northbound between J13 and J14 | Northbound | Broken down vehicle

    Grok got crucial facts wrong about Bondi Beach shooting

    Councillors approve next steps for Hailsham housing development

    Season 2 Streaming Details – Hollywood Life

    Mesa shuts down credit card that rewarded cardholders for paying their mortgages

    Add A Comment
    Leave A Reply Cancel Reply

    Editors Picks

    89th Utkala Dibasa Celebration Brings Odisha’s Vibrant Culture to London

    April 8, 2024

    US and EU pledge to foster connections to enhance research on AI safety and risk.

    April 5, 2024

    Holi Celebrations Across Various Locations in Kent Attract a Diverse Range of Community Participation

    March 25, 2024

    Plans for new Bromley tower blocks up to 14-storeys tall refused

    December 4, 2023
    Latest Posts

    Subscribe to News

    Get the latest sports news from NewsSite about world, sports and politics.

    Advertisement

    Recent Posts

    • Dartford Crossing QEII Bridge to close this weekend
    • M1 northbound between J13 and J14 | Northbound | Broken down vehicle
    • Grok got crucial facts wrong about Bondi Beach shooting
    • Councillors approve next steps for Hailsham housing development
    • Season 2 Streaming Details – Hollywood Life

    Recent Comments

    1. Register on Anycubic users say their 3D printers were hacked to warn of a security flaw
    2. Pembuatan Akun Binance on Braiins Becomes First Mining Pool To Introduce Lightning Payouts
    3. tadalafil tablets sale on The market is forcing cloud vendors to relax data egress fees
    4. cerebrozen reviews on Kent director of cricket Simon Cook adapting to his new role during the close season
    5. Glycogen Review on The little-known town just 5 miles from Kent border with stunning beaches and only 600 residents
    The News Times Logo
    Facebook X (Twitter) Pinterest Vimeo WhatsApp TikTok Instagram

    News

    • UK News
    • US Politics
    • EU Politics
    • Business
    • Opinions
    • Connections
    • Science

    Company

    • Information
    • Advertising
    • Classified Ads
    • Contact Info
    • Do Not Sell Data
    • GDPR Policy
    • Media Kits

    Services

    • Subscriptions
    • Customer Support
    • Bulk Packages
    • Newsletters
    • Sponsored News
    • Work With Us

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    © 2025 The News Times. Designed by The News Times.
    • Privacy Policy
    • Terms
    • Accessibility

    Type above and press Enter to search. Press Esc to cancel.

    Manage Cookie Consent
    To provide the best experiences, we use technologies like cookies to store and/or access device information. Consenting to these technologies will allow us to process data such as browsing behavior or unique IDs on this site. Not consenting or withdrawing consent, may adversely affect certain features and functions.
    Functional Always active
    The technical storage or access is strictly necessary for the legitimate purpose of enabling the use of a specific service explicitly requested by the subscriber or user, or for the sole purpose of carrying out the transmission of a communication over an electronic communications network.
    Preferences
    The technical storage or access is necessary for the legitimate purpose of storing preferences that are not requested by the subscriber or user.
    Statistics
    The technical storage or access that is used exclusively for statistical purposes. The technical storage or access that is used exclusively for anonymous statistical purposes. Without a subpoena, voluntary compliance on the part of your Internet Service Provider, or additional records from a third party, information stored or retrieved for this purpose alone cannot usually be used to identify you.
    Marketing
    The technical storage or access is required to create user profiles to send advertising, or to track the user on a website or across several websites for similar marketing purposes.
    • Manage options
    • Manage services
    • Manage {vendor_count} vendors
    • Read more about these purposes
    View preferences
    • {title}
    • {title}
    • {title}