Top 3 Myths About Securing AI Applications

AI & ML Security

May 20, 2025

5 mins

Despite growing adoption, many AI applications are built on false assumptions about model safety, prompt injection, and system behavior. Below this post unpacks the top myths we encounter in the field andwhy real-world threat pressure demands a different approach.

“𝗧𝗵𝗲 𝗺𝗼𝗱𝗲𝗹 𝗶𝘀 𝘀𝗲𝗰𝘂𝗿𝗲, 𝘀𝗼 𝘁𝗵𝗲 𝗮𝗽𝗽 𝗶𝘀 𝘀𝗲𝗰𝘂𝗿𝗲.”

Not even close.

Even a perfectly fine-tuned LLM can be misused in insecure workflows; prompt injection, tool overreach, vector poisoning, and downstream abuse don’t care how safe your base model is. In some cases the larger the model the more easily it can be coaxed into performing undesired behaviour.

“𝗣𝗿𝗼𝗺𝗽𝘁 𝗶𝗻𝗷𝗲𝗰𝘁𝗶𝗼𝗻 𝗶𝘀 𝘀𝗼𝗹𝘃𝗲𝗱.”

It’s not.

Regex filters and system prompts aren’t silver bullets.

Attackers chain context, leverage encodings, embed triggers, poison memory and bypass naive controls in ways many teams haven’t even threat modelled yet. A recent paper from only a few weeks ago found multiple bypass techniques which worked across all tested guardrails. (“Bypassing Prompt Injection and Jailbreak Detection in LLM Guardrails” - https://lnkd.in/g-prgNCM) - One of the most successful uses using emoji variation selectors (aka emoji smuggling) 😲.

“𝗜𝘁’𝘀 𝗷𝘂𝘀𝘁 𝗮𝗻𝗼𝘁𝗵𝗲𝗿 𝗺𝗶𝗰𝗿𝗼𝘀𝗲𝗿𝘃𝗶𝗰𝗲.”

If only.

Traditional authN/Z patterns and input/output validation break down when your app includes a non-deterministic reasoning engine that can interpret context, rephrase inputs, and initiate tool use. AI apps just don’t behave like REST APIs under pressure and can often surprise.

GenAI introduces a new category of dynamic non-deterministic cyber risk, requiring full-stack, continuous, AI-specific security testing.

At 𝗔𝗽𝗽𝘀𝘂𝗿𝗲𝗻𝘁 𝗖𝘆𝗯𝗲𝗿 𝗦𝗲𝗰𝘂𝗿𝗶𝘁𝘆, we're working with teams to address these myths to help 𝗯𝘂𝗶𝗹𝗱 𝗿𝗲𝘀𝗶𝗹𝗶𝗲𝗻𝘁 𝗮𝗽𝗽𝗹𝗶𝗰𝗮𝘁𝗶𝗼𝗻𝘀 𝗿𝗼𝗼𝘁𝗲𝗱 𝗶𝗻 𝗿𝗲𝗮𝗹-𝘄𝗼𝗿𝗹𝗱 𝘁𝗵𝗿𝗲𝗮𝘁 𝗽𝗿𝗲𝘀𝘀𝘂𝗿𝗲, not hopeful or incomplete assumptions.

Has your organization started integrating adversarial thinking into AI application deployment yet?

‍

Jamie Baxter

Principal at Appsurent