Enfrentando el desafío de la "materia oscura" digital.

Un exceso de información extraña, o «ruido», ha enmascarado características cruciales en el análisis del ADN por IA, un problema similar al de la «materia oscura» digital. Ahora, los científicos pueden tener una manera de arreglar esto.

La inteligencia artificial ha permeado nuestra existencia diaria. Al principio, era visible en ChatGPT y, actualmente, es visible en los anuncios de pizza y cerveza generados por IA. Si bien la IA puede no ser completamente confiable, parece que, a veces, nuestro enfoque de la IA tampoco es del todo confiable.

Un profesor asistente del Laboratorio Cold Spring Harbor (CSHL) descubrió que los científicos que usan herramientas computacionales comunes para interpretar las predicciones de IA captan mucho «ruido» o información adicional al analizar[{» attribute=»»>DNA. And he’s found a way to fix this. Now, with just a couple new lines of code, scientists can get more reliable explanations out of powerful AIs known as deep neural networks. That means they can continue chasing down genuine DNA features. Those features might just signal the next breakthrough in health and medicine. But scientists won’t see the signals if they’re drowned out by too much noise.

So, what causes the meddlesome noise? It’s a mysterious and invisible source like digital “dark matter.” Physicists and astronomers believe most of the universe is filled with dark matter, a material that exerts gravitational effects but that no one has yet seen. Similarly, Koo and his team discovered the data that AI is being trained on lacks critical information, leading to significant blind spots. Even worse, those blind spots get factored in when interpreting AI predictions of DNA function.

Koo says: “The deep neural network is incorporating this random behavior because it learns a function everywhere. But DNA is only in a small subspace of that. And it introduces a lot of noise. And so we show that this problem actually does introduce a lot of noise across a wide variety of prominent AI models.”

The digital dark matter is a result of scientists borrowing computational techniques from computer vision AI. DNA data, unlike images, is confined to a combination of four nucleotide letters: A, C, G, T. But image data in the form of pixels can be long and continuous. In other words, we’re feeding AI an input it doesn’t know how to handle properly.

By applying Koo’s computational correction, scientists can interpret AI’s DNA analyses more accurately.

Koo says: “We end up seeing sites that become much more crisp and clean, and there is less spurious noise in other regions. One-off nucleotides that are deemed to be very important all of a sudden disappear.”

Koo believes noise disturbance affects more than AI-powered DNA analyzers. He thinks it’s a widespread affliction among computational processes involving similar types of data. Remember, dark matter is everywhere. Thankfully, Koo’s new tool can help bring scientists out of the darkness and into the light.

Reference: “Correcting gradient-based interpretations of deep neural networks for genomics” by Antonio Majdandzic, Chandana Rajesh and Peter K. Koo, 9 May 2023, Genome Biology.
DOI: 10.1186/s13059-023-02956-3

The study was funded by the National Institutes of Health and the Simons Center for Quantitative Biology.

Pili Onate

«Alborotador. Amante de la cerveza. Total aficionado al alcohol. Sutilmente encantador adicto a los zombis. Ninja de twitter de toda la vida».

READ Investigadores han descubierto un fósil de una nueva especie de pangolín en Europa

Enfrentando el desafío de la «materia oscura» digital.

SpaceX ha lanzado 23 satélites Starlink desde Florida

Restaurar la testosterona después de dejar el TDA: ¿quién corre el riesgo de no recuperarse a largo plazo?

Los científicos finalmente han descubierto signos de vida en un planeta dos veces más grande que la Tierra.

Astros de Houston vs. Colorado Rockies: Cómo ver el partido de la MLB Mexico City Series de hoy

WBD, el acuerdo de SkyShowtime con la española Movistar Plus+

ESPAÑA – La Facultad de Teología de Sevilla acoge las II Jornadas sobre Deporte y Fe

SpaceX ha lanzado 23 satélites Starlink desde Florida

Entradas recientes

Deja una respuesta Cancelar la respuesta

More Stories

SpaceX ha lanzado 23 satélites Starlink desde Florida

Restaurar la testosterona después de dejar el TDA: ¿quién corre el riesgo de no recuperarse a largo plazo?

Los científicos finalmente han descubierto signos de vida en un planeta dos veces más grande que la Tierra.

You may have missed

Astros de Houston vs. Colorado Rockies: Cómo ver el partido de la MLB Mexico City Series de hoy

WBD, el acuerdo de SkyShowtime con la española Movistar Plus+

ESPAÑA – La Facultad de Teología de Sevilla acoge las II Jornadas sobre Deporte y Fe

SpaceX ha lanzado 23 satélites Starlink desde Florida