AutoScout24 is Europe’s main automotive market platform that connects consumers and sellers of latest and used vehicles, bikes, and business autos throughout a number of European nations. Their long-term imaginative and prescient is to construct a Bot Manufacturing unit, a centralized framework for creating and deploying synthetic intelligence (AI) brokers that may carry out duties and make selections inside workflows, to considerably enhance operational effectivity throughout their group.
From disparate experiments to a standardized framework
As generative AI brokers (methods that may motive, plan, and act) turn out to be extra highly effective, the chance to enhance inner productiveness for AutoScout24 was clear. This led to numerous engineering groups experimenting with the know-how. As AI innovation accelerated throughout AutoScout24, they acknowledged a chance to pioneer a standardized method for AI growth. Whereas AutoScout24 had efficiently experimented with numerous instruments and frameworks on Amazon Net Providers (AWS), they envisioned making a unified, enterprise-grade framework that might allow quicker innovation. Their aim was to determine a paved path that might make it simpler for groups throughout the group to construct safe, scalable, and maintainable AI brokers. The AutoScout24 AI Platform Engineering crew partnered with the AWS Prototype and Cloud Engineering (PACE) crew in a three-week AI bootcamp. The aim was to maneuver from fragmented experiments to a coherent technique by making a reusable blueprint, a Bot Manufacturing unit, to standardize how future AI brokers are constructed and operated inside their firm.
The problem: figuring out a high-impact use case
To floor the Bot Manufacturing unit blueprint in a tangible enterprise case, the crew focused a major operational value: inner developer assist. The issue was well-defined. AutoScout24 AI Platform engineers had been spending as much as 30% of their time on repetitive duties like answering questions, granting entry to instruments, and finding documentation. This assist tax lowered total productiveness. It diverted expert engineers from high-priority characteristic growth and compelled different builders to attend for routine requests to be accomplished. An automatic assist bot was a super first use case as a result of it wanted to carry out two core agent capabilities:
- Data retrieval: Answering “how-to” questions by looking inner documentation, a functionality often called Retrieval Augmented Era (RAG).
- Motion execution: Performing duties in different methods, equivalent to assigning a GitHub Copilot license, which requires safe API integration, or “device use.”
By constructing a bot that might do each, the crew may validate the blueprint whereas delivering speedy enterprise worth.
Architectural overview
On this submit, we discover the structure that AutoScout24 used to construct their standardized AI growth framework, enabling speedy deployment of safe and scalable AI brokers.
The structure is designed with a easy, decoupled stream to ensure the system is each resilient and easy to take care of. The diagram offers a simplified view targeted on the core generative-AI workflow. In a manufacturing setting, extra AWS providers equivalent to AWS Identification and Entry Administration (IAM), Amazon CloudWatch, AWS X-Ray, AWS CloudTrail, AWS Net Software Firewall (WAF), and AWS Key Administration Service (KMS) may very well be built-in to boost safety, observability, and operational governance.
Right here is how a request flows via the system:
- Person interplay through Slack: A developer posts a message in a assist channel, for instance, “@SupportBot, can I get a GitHub Copilot license?“
- Safe ingress through Amazon API Gateway & AWS Lambda: Slack sends the occasion to an Amazon API Gateway endpoint, which triggers an AWS Lambda operate. This operate performs a necessary safety test, verifying the request’s cryptographic signature to verify it’s authentically from Slack.
- Decoupling through Amazon Easy Queue Service (SQS): The verified request is positioned onto an Amazon SQS First-In, First-Out (FIFO) queue. This decouples the front-end from the agent, enhancing resilience. Utilizing a FIFO queue with the message’s thread timestamp because the MessageGroupId makes positive that replies inside a single dialog are processed so as, which is vital for sustaining coherent conversations.
- Agent execution through Amazon Bedrock AgentCore: The SQS queue triggers a Lambda operate when messages arrive, which prompts the agent working within the AgentCore Runtime. AgentCore manages the operational duties, together with orchestrating calls to the inspiration mannequin and the agent’s instruments. The Orchestrator Agent’s logic, constructed with Strands Brokers, analyzes the consumer’s immediate and determines the right specialised agent to invoke—both the Data Base Agent for a query or the GitHub Agent for an motion request.
A vital implementation element is how the system leverages AgentCore’s full session isolation. To take care of conversational context, the system generates a singular, deterministic sessionId for every Slack thread by combining the channel ID and the thread’s timestamp. This sessionId is handed with each agent invocation inside that thread. Interactions in a thread share this identical sessionId, so the agent treats them as one steady dialog. In the meantime, interactions in different threads get totally different sessionIds, holding their contexts separate. In impact, every dialog runs in an remoted session: AgentCore spins up separate assets per sessionId, so context and state don’t leak between threads. In follow, because of this if a developer sends a number of messages in a single Slack thread, the agent remembers the sooner elements of that dialog. Every thread’s historical past is preserved robotically by AgentCore.
This session administration technique can be very important for observability. Based mostly on a singular sessionId, the interplay will be traced utilizing AWS X-Ray, which affords perception into the stream – from the Slack message arriving at API Gateway to the message being enqueued in SQS. It follows the orchestrator’s processing, the decision to the inspiration mannequin, subsequent device invocations (equivalent to a knowledge-base lookup or a GitHub API name), and at last the response again to Slack.
Metadata and timing assist point out the stream of every step to grasp the place time is spent. If a step fails or is sluggish (for instance, a timeout on an exterior API name), X-Ray pinpoints which step brought on the difficulty. That is invaluable for diagnosing issues shortly and constructing confidence within the system’s habits.
The answer: A reusable blueprint powered by AWS
The Bot Manufacturing unit structure designed by the AutoScout24 and AWS groups is event-driven, serverless, and constructed on a basis of managed AWS providers. This method offers a resilient and scalable sample that may be tailored for brand new use circumstances.
The answer builds on Amazon Bedrock and its built-in capabilities:
- Amazon Bedrock offers entry to high-performing basis fashions (FMs), which act because the reasoning engine for the agent.
- Amazon Bedrock Data Bases permits the RAG functionality, permitting the agent to connect with AutoScout24’s inner documentation and retrieve info to reply questions precisely.
- Amazon Bedrock AgentCore is a key element of the operational aspect of the blueprint. It offers the absolutely managed, serverless runtime setting to deploy, function, and scale the brokers.
This answer offers a major benefit for AutoScout24. As a substitute of constructing foundational infrastructure for session administration, safety, and observability, they use AgentCore’s purpose-built providers. This enables the crew to give attention to the agent’s enterprise logic quite than the underlying infrastructure. AgentCore additionally offers built-in safety and isolation options. Every agent invocation runs in its personal remoted container, serving to to forestall knowledge leakage between classes. Brokers are assigned particular IAM roles to limit their AWS permissions (following the precept of least privilege). Credentials or tokens wanted by agent instruments (equivalent to a GitHub API key) are saved securely in AWS Secrets and techniques Supervisor and accessed at runtime. These options give the crew a safe setting for working brokers with minimal customized infrastructure.
The agent itself was constructed utilizing the Strands Brokers SDK, an open-source framework that simplifies defining an agent’s logic, instruments, and habits in Python. This mixture proves efficient: Strands to construct the agent, and AgentCore to securely run it at scale. The crew adopted a complicated “agents-as-tools” design sample, the place a central orchestrator Agent acts as the principle controller. This orchestrator doesn’t include the logic for each potential job. As a substitute, it intelligently delegates requests to specialised, single-purpose brokers. For the assist bot, this included a Data Base agent for dealing with informational queries and a GitHub agent for executing actions like assigning licenses. This modular design makes it easy to increase the system with new capabilities, equivalent to including a PR evaluation agent with out re-architecting your entire pipeline. Operating these brokers on Amazon Bedrock additional enhances flexibility, because the crew can select from a broad vary of basis fashions. Extra highly effective fashions will be utilized to advanced reasoning duties, whereas lighter, cost-efficient fashions are well-suited for routine employee brokers equivalent to GitHub license requests or operational workflows. This potential to combine and match fashions permits Autoscout24 to steadiness value, efficiency, and accuracy throughout their agent structure.
Orchestrator agent: constructed with Strands SDK
Utilizing the Strands Brokers SDK helped the crew to outline the orchestrator agent with concise, declarative code. The framework makes use of a model-driven method, the place the developer focuses on defining the agent’s directions and instruments, and the inspiration mannequin handles the reasoning and planning. The orchestrator agent will be expressed in only a few dozen traces of Python. The instance snippet under (simplified for readability, not meant for direct use) reveals how the agent is configured with a mannequin, a system immediate, and an inventory of instruments (which on this structure signify the specialised brokers):
One other instance is the GitHub Copilot license agent. It’s applied as a Strands device operate. The next snippet reveals how the crew outlined it utilizing the @device decorator. This operate creates a GitHubCopilotSeatAgent, passes the consumer’s request (a GitHub username) to it, and returns the end result:
Key advantages of this method embody clear separation of issues. The developer writes declarative code targeted on the agent’s goal. The advanced infrastructure logic, together with scaling, session administration, and safe execution, is dealt with by Amazon Bedrock AgentCore. This abstraction permits speedy growth and allowed AutoScout24 to maneuver from prototype to manufacturing extra shortly. The instruments listing successfully makes different brokers callable capabilities, permitting the orchestrator to delegate duties without having to know their inner implementation.
The affect: A validated blueprint for enterprise AI
The Bot Manufacturing unit undertaking delivers outcomes that prolonged past the preliminary prototype. It creates speedy enterprise worth and establishes a strategic basis for future AI innovation at AutoScout24.The important thing outcomes had been:
- A production-ready assist bot: The crew deployed a purposeful Slack bot that’s actively decreasing the handbook assist load on the AutoScout24 AI Platform Engineering Group, addressing the 30% of time beforehand spent on repetitive duties.
- A reusable Bot Manufacturing unit blueprint: The undertaking produces a validated, reusable architectural sample. Now, groups at AutoScout24 can construct a brand new agent by beginning with this confirmed template (Slack -> API Gateway -> SQS -> AgentCore). This considerably accelerates innovation by permitting groups to give attention to their distinctive enterprise logic, not on reinventing the infrastructure. This modular design additionally prepares them for extra superior multi-agent collaboration, probably utilizing requirements just like the Agent-to-Agent (A2A) protocol as their wants evolve.
- Enabling broader AI growth: By abstracting away the infrastructure complexity, the Bot Manufacturing unit empowers extra individuals to construct AI options. A website skilled in safety or knowledge analytics can now create a brand new device or specialised agent and “plug it in” to the manufacturing unit without having to be an skilled in distributed methods.
Conclusion: A brand new mannequin for enterprise brokers
AutoScout24’s partnership with AWS turned fragmented generative AI experiments right into a scalable, standardized framework. By adopting Amazon Bedrock AgentCore, the crew moved their assist bot from prototype to manufacturing, whereas specializing in their Bot Manufacturing unit imaginative and prescient. AgentCore manages session state and scaling, so engineers can give attention to high-value enterprise logic as a substitute of infrastructure. The end result is greater than a assist bot: it’s a reusable basis for constructing enterprise brokers. With AgentCore, AutoScout24 can transfer from prototype to manufacturing effectively, setting a mannequin for a way organizations can standardize generative AI growth on AWS. To begin constructing enterprise brokers with Amazon Bedrock, discover the next assets:
Concerning the authors
Andrew Shved is a Senior AWS Prototyping Architect who leads groups and clients in constructing and delivery Generative AI–pushed options, from early prototypes to manufacturing on AWS.
Muhammad Uzair Aslam is a tenured Technical Program Supervisor on the AWS Prototyping crew, the place he works intently with clients to speed up their cloud and AI journeys. He thrives on diving deep into technical particulars and turning complexity into impactful, value-driven options.
Arslan Mehboob is a Platform Engineer and AWS-certified options architect with deep experience in cloud infrastructure, scalable methods, and software program engineering. He at the moment builds resilient cloud platforms and is obsessed with AI and rising applied sciences.
Vadim Shiianov is a Information Scientist specializing in machine studying and AI-driven methods for real-world enterprise purposes. He works on designing and deploying ML and Generative AI options that translate advanced knowledge into measurable affect. He’s obsessed with rising applied sciences and constructing sensible, scalable methods round them.

