Weaponizing generative AI

opinion

Dec 16, 20244 mins

Artificial IntelligenceGenerative AISecurity

The security of genAI models is iffy and takes a back seat to other issues, but with developers increasingly using genAI for code, it needs to become a priority.

Well, that didn’t last long. Generative AI has existed just a few short years, but already we seem to be alternating between bouts of disillusionment and euphoria. More worrying, however, is not where we are on Gartner’s Hype Cycle, but how genAI has already been weaponized, sometimes by accident and sometimes quite intentionally. It’s normal for new technologies to overlook security as they rise to prominence, but for genAI, security shortcomings may erode the trust the technology needs for widespread production use.

Security through obscurity

Early in a technology’s existence, other concerns like performance or convenience may trump security. For years we in the open source world were far too cavalier toward security, trusting in smart-sounding phrases such as “given enough eyeballs, all bugs are shallow” when, in fact, few “eyeballs” actively look at source code. Even though it’s true that open source processes tend toward security even if open source code doesn’t, we took security as a birthright when it was far more likely that much open source software was secure simply because no one had bothered to exploit it yet.

That comfortable myth was shattered by Heartbleed in 2014. Since then, there’s been a steady drumbeat of supply chain attacks against Linux and other prominent open source software, making open source security, not licensing, the must-solve issue for developers. In fact, by one recent estimate, open source malware is up 200% since 2023 and will continue to rise as developers embed open source packages into their projects. As the report authors note, “Open source malware thrives in ecosystems with low entry barriers, no author verification, high usage, and diverse users.”

Worsening that situation is the reality that developers increasingly are saving time by using AI to author bug reports. Such “low-quality, spammy, and LLM [large language model]-hallucinated security reports,” as Python’s Seth Larson calls them, overload project maintainers with time-wasting garbage, making it harder to maintain the security of the project. AI is also responsible for introducing bugs into software, as Symbiotic Security CEO Jerome Robert details. “GenAI platforms, such as [GitHub] Copilot, learn from code posted to sites like GitHub and have the potential to pick up some bad habits along the way” because “security is a secondary objective (if at all).” GenAI, in other words, is highly impressionable and will regurgitate the same bugs (or racist commentary) that it picks up from its source material.

What, me worry?

None of this matters so long as we’re just using generative AI to wow people on X with yet another demo of “I can’t believe AI can create a video I’d never pay to watch.” But as genAI is increasingly used to build all the software we use… well, security matters. A lot.

Unfortunately, it doesn’t yet matter to OpenAI and the other companies building large language models. According to the newly released AI Safety Index, which grades Meta, OpenAI, Anthropic, and others on risk and safety, industry LLMs are, as a group, on track to flunk out of their freshman year in AI college. The best-performing company, Anthropic, earned a C. As Stuart Russell, one of the report’s authors and a UC Berkeley professor, opines, “Although there is a lot of activity at AI companies that goes under the heading of ‘safety,’ it is not yet very effective.” Further, he says, “None of the current activity provides any kind of quantitative guarantee of safety; nor does it seem possible to provide such guarantees given the current approach to AI via giant black boxes trained on unimaginably vast quantities of data.” Not overly encouraging, right?

Meanwhile, genAI is still searching for customers, and one area where it’s seeing widespread adoption is software development. Developers increasingly default to tools like GitHub Copilot for code completion, but what if such tools have been poisoned with malicious code? This is a rising threat and one resistant to detection. It’s only going to get worse as developers come to depend on these tools.

And yet, there’s also cause for hope. As noted above about open source, LLM security will likely improve as enterprises demand heightened security. Today the pressure to improve the accuracy and utility of LLMs currently crowds out security as a first-order concern. We’re already seeing unease over genAI security hamper adoption. We need enterprises to demand that genAI vendors deliver better levels of security, rather than coasting by on hype.

by Matt Asay

Contributing Writer

Matt Asay runs developer marketing at Oracle. Previously Asay ran developer relations at MongoDB, and before that he was a Principal at Amazon Web Services and Head of Developer Ecosystem for Adobe. Prior to Adobe, Asay held a range of roles at open source companies: VP of business development, marketing, and community at MongoDB; VP of business development at real-time analytics company Nodeable (acquired by Appcelerator); VP of business development and interim CEO at mobile HTML5 start-up Strobe (acquired by Facebook); COO at Canonical, the Ubuntu Linux company; and head of the Americas at Alfresco, a content management startup. Asay is an emeritus board member of the Open Source Initiative (OSI) and holds a JD from Stanford, where he focused on open source and other IP licensing issues. The views expressed in Matt’s posts are Matt’s, and don’t represent the views of his employer.

Show me more

Topics

About

Policies

Our Network

More

Weaponizing generative AI

The security of genAI models is iffy and takes a back seat to other issues, but with developers increasingly using genAI for code, it needs to become a priority.

Security through obscurity

What, me worry?

More from this author

Why DocumentDB can be a win for MongoDB

Enterprise essentials for generative AI

Why AI fails at business context, and what to do about it

Who does the unsexy but essential work for open source?

Bridging the trust gap in AI-driven development

The importance of memory for AI

Why front-end development will persist

Why LLMs demand a new approach to authorization

Show me more

Rust Innovation Lab launched, sponsors first project

PostgreSQL 18 to boost OLTP performance, but misses AI readiness

Is Meta’s $10 billion cloud deal a good idea for you?

Getting encryption wrong (and getting it right, too)

How to build a native desktop app vs. a web UI app

PyApp: Build click-to-run Python apps with Rust