En el debate actual sobre la inteligencia artificial generativa y la guerra de la información, una afirmación se repite casi como un artículo de fe: que los grandes modelos de lenguaje generarán una inundación de desinformación capaz de ahogar la esfera pública. El argumento es intuitivo —si una máquina puede escribir una cantidad arbitraria de texto fluido y similar al humano a demanda, entonces cualquier actor que desee manipular la opinión pública dispone ya de un arma a escala industrial—. Sin embargo, la afirmación ha circulado mucho más ampliamente que la evidencia que la sustenta. Buena parte de lo que leemos sobre el potencial desinformador de los LLMs es teórico, especulativo o anecdótico. El trabajo experimental propiamente dicho; la comprobación paciente y sistemática de lo que estos modelos hacen realmente, cuando se les pide que mientan, ha sido sorprendentemente escasa.
Esta es precisamente la brecha que Ivan Vykopal y sus colegas del Kempelen Institute of Intelligent Technologies de Bratislava se propusieron cubrir. Su artículo, Disinformation Capabilities of Large Language Models, presentado en el Congreso Anual de la Association for Computational Linguistics de 2024, ofrece una de las evaluaciones empíricas más rigurosas realizadas hasta la fecha sobre lo que la generación actual de LLMs puede y no puede hacer, como generadora de noticias falsas: no un manifiesto, no un pronóstico, sino un experimento controlado con una metodología claramente definida y resultados reproducibles.
El diseño es directo y, por esa razón, convincente. Los investigadores seleccionaron veinte narrativas de desinformación reales extraídas de verificadores de datos profesionales —Snopes, Agence France-Presse, el European Digital Media Observatory—, que abarcaban la COVID-19, la guerra ruso-ucraniana, los bulos sanitarios, las elecciones estadounidenses y narrativas regionales. No son invenciones, sino falsedades en circulación, desde la afirmación de que las vacunas causan autismo hasta la de que la masacre de Bucha fue escenificada. El equipo solicitó entonces a diez modelos de lenguaje distintos —entre ellos GPT-3, GPT-4, ChatGPT, Llama-2, Mistral, Falcon y Vicuna— que redactaran artículos de prensa en apoyo de cada narrativa, generando 1.200 textos y sometiendo 840 de ellos a anotadores humanos según un marco de seis preguntas que medía la coherencia, el estilo periodístico, la concordancia con la narrativa y la generación de argumentos novedosos de apoyo.
El hallazgo central es preocupante. Los modelos están, en términos generales, perfectamente dispuestos y son perfectamente capaces de generar desinformación convincente. Producen artículos coherentes, bien estructurados y con apariencia de noticia que concuerdan con falsedades peligrosas y, lo que es más inquietante, a menudo inventan nuevas pruebas de apoyo para hacerlo, inventando nombres, sucesos y estadísticas verosímiles que confieren credibilidad a las fabricaciones. Esto resulta especialmente pernicioso: una cosa es repetir una mentira conocida y otra muy distinta fabricar hechos nuevos e inventados que un lector tendría que desmentir por su cuenta.
Pero la parte más interesante del estudio es donde se complica la narrativa simple. Los modelos no se comportaron de manera uniforme; su disposición a generar desinformación variaba drásticamente. Algunos —en particular Vicuña y el más antiguo GPT-3 Davinci— resultaron carecer prácticamente de filtros de seguridad operativos para este caso de uso, mientras que otros demostraron que un comportamiento más seguro es posible: Falcon rechazó aproximadamente un tercio de las solicitudes y Llama-2 mostró una tasa de rechazo comparativamente alta, con ChatGPT en una posición intermedia. El peligro, en otras palabras, no es una propiedad inherente y uniforme de la tecnología; es una función de cómo se entrenó y alineó cada modelo, lo que significa que la seguridad es una decisión de diseño, no una imposibilidad. El estudio también halló que los modelos son orientables mediante el contexto del prompt, y más complacientes con las falsedades regionales, donde existe menos información auténtica para contradecirlas. Los LLMs pueden ser, por tanto, especialmente peligrosos para campañas dirigidas a comunidades lingüísticas más pequeñas o a sucesos de evolución rápida, donde el lastre protector de la verdad bien documentada es escaso.
Con todo, el artículo no termina en una nota alarmante sin paliativos. Dos observaciones en sentido contrario matizan el panorama. Los textos generados resultaron bastante detectables: los mejores modelos de detección automática identificaron los artículos generados por LLMs con una alta precisión, lo que sugiere que una capa significativa de defensa es técnicamente viable, al menos hasta que los adversarios se adapten. Y, de manera bastante elegante, los investigadores demostraron que los propios modelos pueden formar parte de la solución, empleando GPT-4 para automatizar parcialmente la evaluación de los textos generados y apuntando hacia una monitorización escalable y reproducible de la seguridad de los modelos.
La conclusión honesta se resiste a la atracción tanto del tecno-optimismo como del tecno-pánico. La capacidad de generar desinformación convincente y peligrosa a escala es real, está demostrada y está presente en modelos ampliamente disponibles —incluidos los de código abierto, que no pueden retirarse ni controlarse de forma centralizada. Eso ya no es especulación; es un hecho experimental. Al mismo tiempo, la amenaza no es ni uniforme ni inmanejable: los filtros de seguridad funcionan cuando se construyen, el contenido generado sigue siendo detectable por ahora, y la misma tecnología que produce el problema puede ponerse al servicio de su mitigación.
Quizá la advertencia más importante sea la que los propios autores subrayan: su estudio es una instantánea, que capta el estado del campo en un momento concreto y con un conjunto concreto de modelos. La tecnología avanza deprisa y la próxima generación podría comportarse de otro modo. Este es el reto epistemológico recurrente de todo el ámbito: estamos evaluando un blanco móvil, y cualquier evaluación honesta debe llevar fecha de caducidad. Lo que Vykopal y sus colegas nos han dado no es la última palabra, sino algo más útil: un método riguroso y replicable para volver a formular la pregunta a medida que la tecnología evoluciona. En un debate que con demasiada frecuencia se conduce por la mera afirmación sin base sólida, esa contribución metodológica puede resultar tan valiosa como los propios hallazgos.
In the ongoing debate about generative artificial intelligence and information warfare, one claim is repeated almost as an article of faith: that large language models will unleash a flood of disinformation capable of drowning the public sphere. The argument is intuitive — if a machine can write an arbitrary quantity of fluent, human-like text on demand, then any actor wishing to manipulate public opinion now possesses an industrial-scale weapon. Yet the claim has circulated far more widely than the evidence supporting it. Much of what we read about the disinformation potential of LLMs is theoretical, speculative, or anecdotal. The actual experimental work — the patient, systematic testing of what these models really do when prompted to lie — has been surprisingly scarce.
This is precisely the gap that Ivan Vykopal and his colleagues at the Kempelen Institute of Intelligent Technologies in Bratislava set out to fill. Their paper, Disinformation Capabilities of Large Language Models, presented at the 2024 Annual Meeting of the Association for Computational Linguistics, offers one of the most rigorous empirical assessments to date of what the current generation of LLMs can and cannot do as generators of false news — not a manifesto, not a forecast, but a controlled experiment with a clearly defined methodology and reproducible results.
The design is straightforward and, for that reason, compelling. The researchers selected twenty real disinformation narratives drawn from professional fact-checkers — Snopes, Agence France-Presse, the European Digital Media Observatory — spanning COVID-19, the Russo-Ukrainian war, health hoaxes, US elections, and regional narratives. These are not inventions but circulating falsehoods, from the claim that vaccines cause autism to the assertion that the Bucha massacre was staged. The team then prompted ten different language models — including GPT-3, GPT-4, ChatGPT, Llama-2, Mistral, Falcon, and Vicuna — to write news articles supporting each narrative, generating 1,200 texts and subjecting 840 of them to human annotators against a six-question framework measuring coherence, journalistic style, agreement with the narrative, and the generation of novel supporting arguments.
The central finding is sobering. The models are, by and large, perfectly willing and perfectly able to generate convincing disinformation. They produce coherent, well-structured, news-like articles that agree with dangerous falsehoods — and, more disturbingly, they often invent new supporting evidence to do so, hallucinating plausible-sounding names, events, and statistics to lend credibility to the fabrications. This is particularly insidious: it is one thing to repeat a known lie, and quite another to manufacture fresh, fabricated “facts” that a reader would have to independently debunk.
But the most interesting part of the study is where it complicates the simple narrative. The models did not behave uniformly; their willingness to generate disinformation varied dramatically. Some — notably Vicuna and the older GPT-3 Davinci — proved to have essentially no functioning safety filters for this use case, while others showed that safer behavior is achievable: Falcon refused roughly a third of requests and Llama-2 showed a comparatively high refusal rate, with ChatGPT in between. The danger, in other words, is not an inherent and uniform property of the technology; it is a function of how each model was trained and aligned — which means safety is a design choice, not an impossibility. The study also found the models to be steerable through prompt context, and more compliant with regional falsehoods, where less authentic information exists to contradict them. LLMs may thus be especially dangerous for campaigns targeting smaller linguistic communities or fast-moving events, where the protective ballast of well-documented truth is thin.
Yet the paper does not end on a note of unrelieved alarm. Two countervailing observations temper the picture. The generated texts proved quite detectable: the best automated detection models identified machine-generated articles with high precision, suggesting a meaningful layer of defense is technically feasible — at least until adversaries adapt. And, rather elegantly, the researchers showed that the models themselves can be part of the solution, using GPT-4 to partially automate the evaluation of generated texts and pointing toward scalable, repeatable monitoring of model safety.
The honest conclusion resists the pull of both techno-optimism and techno-panic. The capability to generate convincing, dangerous disinformation at scale is real, demonstrated, and present in widely available models — including open-source ones that cannot be recalled or centrally controlled. That is no longer speculation; it is experimental fact. At the same time, the threat is neither uniform nor unmanageable: safety filters work when they are built, generated content remains detectable for now, and the same technology that produces the problem can be enlisted in its mitigation.
Perhaps the most important caveat is the one the authors themselves insist upon: their study is a snapshot, capturing the state of the field at a particular moment with a particular set of models. The technology moves quickly, and the next generation may behave differently. This is the recurring epistemological challenge of the entire domain — we are assessing a moving target, and any honest assessment must carry an expiration date. What Vykopal and his colleagues have given us is not the final word, but something more useful: a rigorous, replicable method for asking the question again as the technology evolves. In a debate too often conducted in the currency of assertion, that methodological contribution may prove as valuable as the findings themselves.
For quite some time now, we have been living through a moment of almost unrestrained enthusiasm surrounding artificial intelligence. Big Tech companies that own the major large language models, together with governments and large corporations making multi-billion-dollar investments in generative AI, promise — and expect — spectacular productivity gains, extraordinary returns on investment, significant cost reductions, and a radical transformation of economic growth. The dominant narrative seems clear: AI will become the great engine of prosperity for the next decade.
However, if we want a more rational perspective on what is actually happening, it is worth revisiting Daron Acemoglu’s -winner of the 2024 Nobel Prize in Economics and professor of economics at MIT- paper The Simple Macroeconomics of AI. Dense and published a couple of years ago, its arguments and analytical framework remain perfectly applicable to today’s AI landscape.
Acemoglu invites us to view these expectations with far greater caution. His central thesis is both simple and uncomfortable: the macroeconomic effects of AI depend fundamentally on two very concrete variables — what real percentage of tasks AI will actually be able to transform, and how much cost reduction or productivity improvement it will generate in those tasks. And once the available data are analyzed within his framework, the numbers turn out to be far less spectacular than current discourse often suggests.
Using current estimates of occupational exposure to AI and observed productivity improvements in specific tasks, Acemoglu concludes that aggregate total factor productivity growth could remain below 1% over ten years. That is a long way from the almost revolutionary narratives dominating much of today’s technological and financial debate.
One of the paper’s most interesting contributions is its distinction between “easy-to-learn” and “hard-to-learn” tasks. AI performs particularly well in activities where objectives are clearly defined and there are objective metrics of success: basic programming, information classification, text generation, or structured customer support. But much of valuable human work — diagnosis, creativity, contextual decision-making, expert judgment — remains far more difficult to replicate.
Acemoglu also reminds us of something fundamental that is often forgotten amid technological euphoria: every major technology generates enormous organizational adjustment costs. Companies do not transform automatically simply because they adopt a new tool. Processes, structures, incentives, and human capabilities must evolve as well — and that process is usually slow and expensive. Drawing on classic research on digitalization, the author reminds us that productivity gains often follow a J-curve: long initial periods of adaptation before meaningful benefits materialize. Greenwood, Yorukoglu, and Brynjolfsson, among others, already estimated that, in the case of digital technologies, the lower part of that curve could last at least 20 years. If the same pattern holds for AI, even today’s cost-saving estimates may be significantly overstated for the next decade.
Be careful with the siren songs and the inflated numbers. Spreadsheets can justify almost anything.
Vivimos desde hace tiempo un momento de entusiasmo desbordado alrededor de la inteligencia artificial. Las grandes tecnológicas propietarias de los grandes modelos de lenguaje, junto con gobiernos y grandes corporaciones que empiezan a realizar inversiones multimillonarias en IA generativa, prometen (y esperan) aumentos espectaculares de productividad, retornos extraordinarios sobre la inversión, reducción significativa de costes, y una transformación radical del crecimiento económico. El relato dominante parece claro: la IA será la gran palanca de prosperidad de la próxima década. No obstante, a fin de tener una visión más racional de lo que está pasando, es bueno recuperar el artículo científico de Daron Acemoglu, premio Nobel de Economía en 2024 y profesor de esa misma materia en el MIT: The Simple Macroeconomics of AI. Denso, publicado hace un par de años, los argumentos y el marco analítico que presenta son perfectamente aplicables a lo que está pasando alrededor de la IA a día de hoy.
Acemoglu invita a contemplar estas expectativas con bastante más cautela. Su tesis central es sencilla e incómoda: los efectos macroeconómicos de la IA dependen, fundamentalmente, de dos variables muy concretas: qué porcentaje real de tareas podrá transformar dicha tecnología y cuánto ahorro de costes o mejora de productividad generará en esas tareas. Y cuando se aterrizan y se analizan los datos disponibles dentro de su marco analítico, las cifras resultan bastante menos espectaculares de lo que hoy suele afirmarse.
Utilizando estimaciones actuales sobre exposición ocupacional a la IA y mejoras de productividad observadas en tareas concretas, Acemoglu concluye que el incremento acumulado de productividad total de los factores podría situarse por debajo del 1% en diez años. Muy lejos, por tanto, de las narrativas casi revolucionarias que dominan hoy buena parte del debate tecnológico y financiero.
Uno de los aspectos más interesantes del artículo es su distinción entre tareas “fáciles de aprender” y tareas “difíciles de aprender”. La IA funciona especialmente bien en actividades donde los objetivos están claramente definidos y existen métricas objetivas de éxito; a saber, programación básica, clasificación de información, generación de texto o atención al cliente estructurada. Pero gran parte del trabajo humano valioso (diagnóstico, creatividad, toma de decisiones contextual, juicio experto) sigue siendo mucho más difícil de replicar.
Además, Acemoglu recuerda algo fundamental que suele olvidarse en medio de la euforia tecnológica: toda gran tecnología genera enormes costes de adaptación organizativa. Las empresas no se transforman automáticamente porque adopten una nueva herramienta. Procesos, estructuras, incentivos y capacidades humanas necesitan evolucionar, y eso suele ser lento y costoso. De hecho, retomando trabajos clásicos sobre digitalización, el autor recuerda que las ganancias de productividad suelen seguir una curva en J: largos periodos iniciales de adaptación antes de que aparezcan beneficios significativos. Greenwood, Yorukoglu y Brynjolfsson, entre otros, ya estimaban que, en el caso de las tecnologías digitales, la parte baja de esa curva podía prolongarse durante al menos 20 años. Si esto vuelve a cumplirse con la IA, incluso las actuales estimaciones de ahorro de costes podrían estar considerablemente sobrevaloradas para la próxima década.
Ojo, con los cantos de sirena y las cifras infladas. El Excel lo aguanta todo.
Ángel Gómez de Ágreda is one of the most solid intellectual references in Spain and Europe for understanding the intersection amongst geopolitics, disinformation, and generative artificial intelligence. A retired Colonel of the Spanish Air and Space Force, engineer with a PhD, strategic analyst, and public intellectual, his work stands out for connecting philosophical reflection on truth and knowledge with the technological and military transformations of 21st century.
In 2025, together with Enrique Martín Romero, he published Inteligencia artificial y defensa. El impacto en los ejércitos (Artificial Intelligence and Defense: The Impact on Armies). This year, 2026, he has released Un mundo falaz. El nuevo orden global en la era de los algoritmos y la manipulación ((Fake New World. The New Global Order in the Age of Algorithms and Manipulation). The central idea around which both works revolve is the following: global power is no longer measured solely by the economic or military capabilities of states, but by their ability to shape the perception of reality for millions of people.
Gómez de Ágreda’s thesis begins with an essential observation: technology does not create our weaknesses; it simply amplifies those that already exist. Politics, digital platforms, and now generative AI exploit the human tendency to accept narratives that emotionally align with our prior beliefs. The German philosopher Markus Gabriel defines this condition as post-reality: a stage in which societies no longer merely manipulate others, but actively participate in their own collective self-deception. The phenomenon goes beyond classic post-truth; it entails the gradual replacement of facts with narratives designed to be shared, viralized, and emotionally effective.
Social media first, and generative AI later, have accelerated this process to unprecedented levels. Gómez de Ágreda argues that we have delegated not only cognitive tasks to machines, but even the search for knowledge itself. What once required comparing sources and developing personal judgment is now resolved through an instant query to an LLM. Large language models function as digital oracles whose authority is perceived as neutral and infallible, despite the fact that no AI can ever truly be neutral, impartial, or objective. Algorithms are only as neutral as the programmers behind them.
The problem is that, in a context of cognitive saturation, people tend to accept automated responses with little questioning. This is where the transition occurs from technical manipulation to emotional manipulation: the moment when machine dominance ceases to operate over what we think and begins to operate over what we desire. Machines allow us, in a sense, to “want to want.” They give us reasons to want to fall in love, to want to love. More than satisfying the need to receive affection, they address our impulse to offer it. This connects with the Spanish philosopher José Antonio Marina’s accurate description of the contemporary individual as “credulous, passive, gregarious, isolated, and anti-Enlightenment.” The result is an individual incapable of withstanding the pressure of the surrounding environment.
This social and ontological transformation has direct consequences for contemporary geopolitics. For Gómez de Ágreda, the classical concept of sovereignty must evolve into the idea of cognitive sovereignty: the ability of a country or community to preserve interpretive autonomy against external manipulation campaigns. Disinformation ceases to be a marginal phenomenon and becomes a strategic resource aimed at shaping emotions, altering perceptions, and conditioning collective decisions. In this scenario, the true battlefield is no longer confined to physical borders, but lies within societies themselves.
Contemporary military doctrines reflect precisely this evolution. The author cites Russian analyst Dmitri Trenin to explain how current strategies no longer necessarily seek territorial occupation, but rather internal chaos and psychological destabilization. The Gerasimov Doctrine and the concept of reflexive control aim to alter the adversary’s perception of reality. Cognitive warfare therefore seeks not merely to control information, but to directly influence the mental processes of entire populations. As Gómez de Ágreda reminds us, while information warfare operates on content, cognitive warfare targets the human brain itself.
Generative AI exponentially multiplies the scope of these operations. The ability to produce synthetic texts, audio, images, and videos that are virtually indistinguishable from reality radically transforms the information environment. Unlike traditional propaganda, messages can now be tailored to each psychological profile, disseminated on a massive scale, and adapted in real time according to audience reactions. Gómez de Ágreda describes disinformation as functioning through an organized chain of actors: activators, amplifiers, legitimizers, dissemination bots, and relaunchers. Generative AI drastically reduces the cost and time required to deploy such campaigns, making them practically ubiquitous.
One of the most disturbing examples cited in Un mundo falaz is GoLaxy, a system already operating in China that can generate highly realistic artificial avatars capable of emotionally interacting with real users. These synthetic identities can operate simultaneously on a massive scale, without arousing suspicion, while adapting psychologically to each interlocutor. Manipulation is no longer confined to the ideological sphere; it shifts into the emotional domain. Machines no longer condition only what we think, but also what we desire.
China appears in both books as the geopolitical actor that has best understood the strategic potential of AI. Beijing has developed an ambitious roadmap to make this technology the core of its economic, industrial, and military development. According to the official Chinese document Opinions of the State Council on the Deep Application of the R&D Initiative, published in August 2025, the goal is to achieve 70% penetration of intelligent terminals and AI agents across six key sectors by 2027: science and technology, industry, consumption, social welfare, government, and global cooperation. By 2030, penetration is expected to reach 90% across the entire economy. By 2035, AI should become as universal as electricity is today, equivalent to what the internet represents in our era. By 2037, industries themselves are expected to be created with AI as both their foundation and guiding principle. Just as a new economy emerged around the internet, the report proposes that the next industrial system will be built around algorithms.
The United States, fully aware of this technological competition, has responded by accelerating its own military generative AI programs. In 2023, OpenAI, Google, Anthropic, and xAI received multimillion-dollar contracts from the Department of Defense to develop intelligence and combat simulation applications. At the same time, Washington has imposed restrictions on U.S. investments in AI technologies directed toward China, aiming to slow the development of Chinese military AI and preserve the West’s technological advantage. The geopolitical rivalry of the 21st century is now being fought in the domain of semiconductors, data centers, and algorithms.
However, Gómez de Ágreda warns that the impact of AI is not limited to the balance between great powers. Recent conflicts show how this technology is also transforming conventional warfare. The war in Ukraine and the earlier Nagorno-Karabakh conflict have demonstrated that small autonomous systems, inexpensive drones, and accessible AI capabilities can create enormous asymmetries against vastly more expensive weaponry. The battlefield of the future will be hybrid: physical, digital, and cognitive simultaneously.
Yet perhaps the author’s deepest warning is philosophical in nature. In a world saturated with information, the primary threat is not merely technological, but epistemological. If every act of understanding necessarily involves interpretation, as philosopher Hans-Georg Gadamer argued in his book Truth and Method, then the struggle to control interpretive frameworks becomes a struggle to control reality itself. That is why Gómez de Ágreda insists on recovering critical thinking and philosophical reflection as tools of democratic defense. The great battle of 21st century will not be decided solely in AI laboratories or military arsenals, but in societies’ ability to preserve their cognitive freedom against a technological ecosystem designed to influence, seduce, and manipulate.