Recherche : [IA] - Au fil du web

Hospitals use a transcription tool powered by an error-prone OpenAI model

A few months ago, my doctor showed off an AI transcription tool he used to record and summarize patient meetings. In my case, the summary was fine, but researchers cited in this report by The Associated Press have found that’s not always the case for transcriptions created by OpenAI’s Whisper, which powers a tool many hospitals use — sometimes it just makes things up entirely.

Whisper is used by a company called Nabla for a tool that it estimates has transcribed 7 million medical conversations, according to AP. More than 30,000 clinicians and 40 health systems use it, the outlet writes. The report says that Nabla officials “are aware that Whisper can hallucinate and are addressing the problem.” In a blog post published Monday, execs wrote that their model includes improvements to account for the “well-documented limitations of Whisper.”

A group of researchers from Cornell University, the University of Washington, and others described their findings in a peer-reviewed study presented in June at the Association for Computing Machinery FAccT conference.

According to the researchers, “While many of Whisper’s transcriptions were highly accurate, we find that roughly one percent of audio transcriptions contained entire hallucinated phrases or sentences which did not exist in any form in the underlying audio... 38 percent of hallucinations include explicit harms such as perpetuating violence, making up inaccurate associations, or implying false authority.”

The researchers noted that “hallucinations disproportionately occur for individuals who speak with longer shares of non-vocal durations,” which they said is more common for those with a language disorder called aphasia.

Sante · IA

May 14, 2026 at 12:01:29 AM GMT+2 * · permalien

·

https://www.theverge.com/2024/10/27/24281170/open-ai-whisper-hospitals-transcription-hallucinations-studies

o1 et Claude sont-ils capables de nous MANIPULER ? Deux études récentes aux résultats troublants

0:00 - Intro
1:42 - Qu'est-ce qu'un agent autonome ?
4:01 - Un LLM peut-il mentir et manipuler sans qu'on le lui demande ?
5:30 - 1er cas : quand o1 s'exfiltre sur un autre serveur
9:25 - Limite : contamination par la fiction et "Nothing else matters"
13:28 - 2e cas : quand o1 ment effrontément
17:02 - Sans "Nothing else matters" : un cas plus convaincant
18:58 - Un objectif long terme en prompt suffit à pousser à la manipulation
20:19 - Sans objectif long terme en prompt : les cas le plus troublants
24:20 - Sandbagging et objectif long terme acquis lors du RLHF
27:26 - Claude peut-il comprendre spontanément qu'il est testé ?
29:13 - Le résultat sur le sandbagging est curieusement négligé
30:41 - Conclusion et synthèse
31:28 - Eh non, c'est pas fini.
32:41 - Le principal résultat de l'article d'Anthropic : quand Claude feint l'alignement
37:45 - Version "prompt", version "fine-tuned", version RL
42:16 - Les scrupules de Claude
44:58 - La dimension morale des valeurs que protège Claude est-elle importante ?
48:08 - Conclusion de l'article
49:09 - Outro

IA · Video

February 1, 2025 at 11:50:35 AM GMT+1 * · permalien

·

https://www.youtube.com/watch?v=cw9wcNKDOtQ

🤖 Les machines à faire douter - DEFAKATOR

De l’usage bénéfique, malveillant ou débile de la puissance de la rhétorique artificielle.
Avec la participation de Flefgraph.

IA

January 5, 2025 at 5:04:18 PM GMT+1 * · permalien

·

https://www.youtube.com/watch?v=CTfparMJSSQ

Ivre, l’IA générative ne sait pas bien générer des verres de vin · Numerama.com

L’intelligence artificielle générative a un nouveau souci : les verres de vin. Et pas n’importe lesquels : les verres de vin remplis à ras bord. À moins de faire un prompt tarabiscoté et de multiplier les essais, les IA génératives comme Midjourney et Dall-E peinent beaucoup à verser le liquide jusqu’en haut du récipient.

Longtemps, l’intelligence artificielle avait une faiblesse bien connue des spécialistes, qui permettait d’ailleurs de facilement repérer les images factices. Il suffisait de regarder attentivement les mains : elles avaient bien souvent trop de doigts. Depuis, les systèmes d’IA ont progressé et on ne peut plus trop compter sur cette astuce.

Avec le temps, d’autres points faibles de l’IA générative ont cependant été repérés. Par exemple, on a constaté que certaines plateformes peinaient à créer des spaghettis et des monocycles. Mais, depuis peu, un autre sujet a l’air de mettre en grande difficulté les Midjourney et autres Dall-E : ce sont les verres de vin.

IA

October 31, 2024 at 6:31:56 PM GMT+1 * · permalien

·

https://www.numerama.com/tech/1832248-ivre-lia-generative-ne-sait-pas-bien-generer-des-verres-de-vin.html

What's Next for AI in Video Games? | Humble Bundle

Learn how Generative AI is used in the gaming industry, how it may evolve, and the controversies AI brings to video game development.

What’s Next for AI in Video Games?

As the video game industry looks ahead to 2024, many topics dominate the conversations and minds of developers, creators, and gamers alike. One of the most prominent and imminent subjects is AI in video games. Recent developments in technology for AI-generated video games have some people concerned or intrigued with what’s ahead for the industry.

IA · JeuVideo

February 9, 2024 at 12:23:53 PM GMT+1 * · permalien

·

https://blog.humblebundle.com/2024/02/01/whats-next-for-ai-in-video-games/

Create images with your words - Bing Image Creator comes to the new Bing - The Official Microsoft Blog

Last month we introduced the new AI-powered Bing and Microsoft Edge, your copilot for the web – delivering better search, complete answers, a new chat experience and the ability to create content. Already, we have seen that chat is reinventing how people search with more than 100 million chats to date. We’ve seen people use chat in a variety of ways, from refining answers to complex questions to using it as a form of entertainment or for creative inspiration. Today we’re taking the chat experience to the next level by making the new Bing more visual.

We’re excited to announce we are bringing Bing Image Creator, new AI-powered visual Stories and updated Knowledge Cards to the new Bing and Edge preview. Powered by an advanced version of the DALL∙E model from our partners at OpenAI, Bing Image Creator allows you to create an image simply by using your own words to describe the picture you want to see. Now you can generate both written and visual content in one place, from within chat.

IA · image

March 21, 2023 at 5:29:34 PM GMT+1 * · permalien

·

https://blogs.microsoft.com/blog/2023/03/21/create-images-with-your-words-bing-image-creator-comes-to-the-new-bing/

De quoi ChatGPT est-il VRAIMENT capable ? | Ft. Science4All · Tribunes sur Zeste de Savoir

Soyons clair : ne peut jamais savoir si une réponse que donne ChatGPT n’est pas complètement à côté de la plaque, et ceci d’une façon difficile à détecter si on ne s’y connaît pas.

Sommaire :
0:00 - Intro - Qu'est-ce que ChatGPT ?
2:49 - À quoi les modèles de langage sont-ils entraînés ?
6:10 - Test : où l'on voit que ça ne RÉPOND pas, ça PRÉDIT
10:04 - Mais quand on parle de sujets sensibles...
11:42 - Lê sur l'"éducation" des modèles de langage
14:25 - Lê présente Tournesol
16:51 - ChatGPT : transformer un modèle de langage en chatbot
17:33 - Le problème de Blake Lemoine : un chatbot conscient ?
21:30 - ChatGPT prétend répondre correctement...
24:07 - Mais ChatGPT n'est pas du tout fiable.
27:05 - La machine à bullshit
30:13 - ChatGPT peut-il surpasser l'intelligence humaine ?
33:14 - Quand un modèle de langage joue aux échecs...
36:19 - Outro

IA

January 12, 2023 at 9:48:38 AM GMT+1 * · permalien

·

https://zestedesavoir.com/billets/4408/signet-de-quoi-chatgpt-est-il-vraiment-capable-ft-science4all/

This Person Does Not Exist

Imagined by a GAN (generative adversarial network) StyleGAN2 (Dec 2019) - Karras et al. and Nvidia

https://github.com/NVlabs/stylegan2

image · IA

March 8, 2022 at 12:28:56 PM GMT+1 * · permalien

·

https://thispersondoesnotexist.com/