We spoke with Dr. Swami Sivasubramanian, Vice President of Knowledge and AI, shortly after AWS re:Invent 2024 to listen to his impressions—and to get insights on how the most recent AWS improvements assist meet the real-world wants of consumers as they construct and scale transformative generative AI functions.
Q: What made this re:Invent completely different?
Swami Sivasubramanian: The theme I spoke about in my re:Invent keynote was easy however highly effective—convergence. I imagine that we’re at an inflection level not like another within the evolution of AI. We’re seeing a outstanding convergence of knowledge, analytics, and generative AI. It’s a mixture that permits next-level generative AI functions which are way more succesful. And it lets our clients transfer sooner in a very important manner, getting extra worth, extra rapidly. Corporations like Rocket Mortgage are constructing on an AI-driven platform powered by Amazon Bedrock to create AI brokers and automate duties—working to provide their staff entry to generative AI with no-code instruments. Canva makes use of AWS to energy 1.2 million requests a day and sees 450 new designs created each second. There’s additionally a human aspect to convergence, as folks throughout organizations are working collectively in new methods, requiring a deeper stage of collaboration between teams, like science and engineering groups. And this isn’t only a one-time collaboration. It’s an ongoing course of.
Folks’s expectations for functions and buyer experiences are altering once more with generative AI. More and more, I believe generative AI inference goes to be a core constructing block for each software. To appreciate this future, organizations want greater than only a chatbot or a single highly effective massive language mannequin (LLM). At re:Invent, we made some thrilling bulletins about the way forward for generative AI, after all. However we additionally launched a outstanding portfolio of recent merchandise, capabilities, and options that may assist our clients handle generative AI at scale—making it simpler to regulate prices, construct belief, improve productiveness, and ship ROI.
Q: Are there key improvements that construct on the expertise and classes realized at Amazon in adopting generative AI? How are you bringing these capabilities to your clients
Swami Sivasubramanian: Sure, our announcement of Amazon Nova, a brand new era of basis fashions (FMs), has state-of-the-art intelligence throughout a variety of duties and industry-leading worth efficiency. Amazon Nova fashions increase the rising number of the broadest and most succesful FMs in Amazon Bedrock for enterprise clients. The particular capabilities of Amazon Nova Micro, Lite, and Professional exhibit distinctive intelligence, capabilities, and pace—and carry out fairly competitively in opposition to the most effective fashions of their respective classes. Amazon Nova Canvas, our state-of-the-art picture era mannequin, creates skilled grade photographs from textual content and picture inputs, democratizing entry to production-grade visible content material for promoting, coaching, social media, and extra. Lastly, Amazon Nova Reel affords state-of-the-art video era that enables clients to create high-quality video from textual content or photographs. With about 1,000 generative AI functions in movement inside Amazon, teams like Amazon Adverts are using Amazon Nova to take away boundaries for sellers and advertisers, enabling new ranges of creativity and innovation. New capabilities like picture and video era are serving to Amazon Adverts clients promote extra merchandise of their catalogs, and experiment with new methods like keyword-level inventive to extend engagement and drive gross sales.

However there’s extra forward, and right here’s the place an essential shift is going on. We’re engaged on an much more succesful any-to-any mannequin the place you may present textual content, photographs, audio, and video as enter and the mannequin can generate outputs in any of those modalities. And we predict this multi-modal method is how fashions are going to evolve, transferring forward the place one mannequin can settle for any form of enter and generate any form of output. Over time, I believe that is what state-of-the-art fashions will appear like.
Q: Talking of bulletins like Amazon Nova, you’ve been a key innovator in AI for a few years. What continues to encourage you?
Swami Sivasubramanian: It’s fascinating to consider what LLMs are able to. What conjures up me most although is how can we assist our clients unblock the challenges they’re dealing with and understand that potential. Contemplate hallucinations. As extremely succesful as right this moment’s fashions are, they nonetheless generally tend to get issues mistaken sometimes. It’s a problem that a lot of our clients wrestle with when integrating generative AI into their companies and transferring to manufacturing. We explored the issue and requested ourselves if we might do extra to assist. We appeared inward, and leveraged Automated Reasoning, an innovation that Amazon has been utilizing as a behind-the-scenes expertise in a lot of our providers like id and entry administration.
I like to consider this case as yin and yang. Automated Reasoning is all about certainty and having the ability to mathematically show that one thing is right. Generative AI is all about creativity and open-ended responses. Although they may look like opposites, they’re really complementary—with Automated Reasoning finishing and strengthening generative AI. We’ve discovered that Automated Reasoning works very well when you’ve got an enormous floor space of an issue, a corpus of data about that drawback space, and when it’s crucial that you simply get the proper reply—which makes Automated Reasoning an excellent match for addressing hallucinations.
At re:Invent, we introduced Amazon Bedrock Guardrails Automated Reasoning checks—the first and only generative AI safeguard that helps stop factual errors attributable to hallucinations. All through the use of logically correct and verifiable reasoning that explains why generative AI responses are right. I believe that it’s an innovation that may have important impression throughout organizations and industries, serving to construct belief and speed up generative AI adoption.
Q: Controlling prices is essential to all organizations, massive and small, significantly as they take generative AI functions into manufacturing. How do the bulletins at re:Invent reply this want?
Swami Sivasubramanian: Like our clients, right here at Amazon we’re rising our funding in generative AI improvement, with a number of tasks in course of—all requiring well timed entry to accelerated compute sources. However allocating optimum compute capability to every mission can create a provide/demand problem. To handle this problem, we created an inside service that helped Amazon drive utilization of compute sources to greater than 90% throughout all our tasks. This service enabled us to easy out demand throughout tasks and obtain increased capability utilization, dashing improvement.
As with Automated Reasoning, we realized that our clients would additionally profit from these capabilities. So, at re:Invent, I introduced the brand new job governance functionality in Amazon SageMaker HyperPod, which helps our clients optimize compute useful resource utilization and scale back time to market by as much as 40%. With this functionality, customers can dynamically run duties throughout the end-to-end FM workflow— accelerating time to marketplace for AI improvements whereas avoiding price overruns attributable to underutilized compute sources.
Our clients additionally inform me that the trade-off between price and accuracy for fashions is actual. We’re answering this want by making it super-easy to guage fashions on Amazon Bedrock, so that they don’t should spend months researching and making comparisons. We’re additionally decreasing prices with game-changing capabilities such Amazon Bedrock Mannequin Distillation, which pairs fashions for decrease prices; Amazon Bedrock Clever Immediate Routing, which manages prompts extra effectively, at scale; and immediate caching, which reduces repeated processing with out compromising on accuracy.
Q: Larger productiveness is without doubt one of the core guarantees of generative AI. How is AWS serving to staff in any respect ranges be extra productive?
Swami Sivasubramanian: I wish to level out that utilizing generative AI turns into irresistible when it makes staff 10 occasions extra productive. In brief, not an incremental improve, however a serious leap in productiveness. And we’re serving to staff get there. For instance, Amazon Q Developer is remodeling code improvement by caring for the time-consuming chores that builders don’t wish to cope with, like software program upgrades. And it additionally helps them transfer a lot sooner by automating code critiques and coping with mainframe modernization. Contemplate Novacomp, a number one IT firm in Latin America, which leveraged Amazon Q Developer to improve a mission with over 10,000 strains of Java code in simply 50 minutes, a job that might have sometimes taken an estimated 3 weeks. The corporate additionally simplified on a regular basis duties for builders, decreasing its technical debt by 60% on common.
On the enterprise aspect, Amazon Q Enterprise is bridging the hole between unstructured and structured information, recognizing that almost all companies want to attract from a mixture of information. With Amazon Q in QuickSight, non-technical customers can leverage pure language to construct, uncover, and share significant insights in seconds. Now they will entry databases and information warehouses, in addition to unstructured enterprise information, like emails, experiences, charts, graphs, and pictures.
And searching forward, we introduced advanced agentic capabilities for Amazon Q Enterprise, coming in 2025, which can use brokers to automate advanced duties that stretch throughout a number of groups and functions. Brokers give generative AI functions next-level capabilities, and we’re bringing them to our clients through Amazon Q Enterprise, in addition to Amazon Bedrock multi-agent collaboration, which improves profitable job completion by 40% over in style options. This main enchancment interprets to extra correct and human-like outcomes in use circumstances like automating buyer assist, analyzing monetary information for threat administration, or optimizing supply-chain logistics.
It’s all a part of how we’re enabling larger productiveness right this moment, with much more on the horizon.
Q: To get staff and clients adopting generative AI and benefiting from that elevated productiveness, it must be trusted. What steps is AWS taking to assist construct that belief?
Swami Sivasubramanian: I believe that lack of belief is an enormous impediment to transferring from proof of idea to manufacturing. Enterprise leaders are about to hit go they usually hesitate as a result of they don’t wish to lose the belief of their clients. As generative AI continues to drive innovation throughout industries and our day by day life, the necessity for accountable AI has turn out to be more and more acute. And we’re serving to meet that want with improvements like Amazon Bedrock Automated Reasoning, which I discussed earlier, that works to forestall hallucinations—and will increase belief. We additionally introduced new LLM-as-a-judge capabilities with Amazon Bedrock Mannequin Analysis so now you can carry out assessments and consider different fashions with humanlike high quality at a fraction of the associated fee and time of working human evaluations. These evaluations assess a number of high quality dimensions, together with correctness, helpfulness, and accountable AI standards similar to reply refusal and harmfulness.
I also needs to point out that AWS just lately turned the primary main cloud supplier to announce ISO/IEC 42001 accredited certification for AI providers, overlaying Amazon Bedrock, Amazon Q Enterprise, Amazon Textract, and Amazon Transcribe. This worldwide administration system customary outlines necessities and controls for organizations to advertise the accountable improvement and use of AI methods. Technical requirements like ISO/IEC 42001 are important as a result of they supply a much-needed widespread framework for accountable AI improvement and deployment.
Q: Knowledge stays central to constructing extra personalised experiences relevant to your small business. How do the re:Invent launches assist AWS clients get their information prepared for generative AI?
Swami Sivasubramanian: Generative AI isn’t going to be helpful for organizations except it could actually seamlessly entry and deeply perceive the group’s information. With these insights, our clients can create personalized experiences, similar to extremely personalised customer support brokers that may assist service representatives resolve points sooner. For AWS clients, getting information prepared for generative AI isn’t only a technical problem—it’s a strategic crucial. Proprietary, high-quality information is the important thing differentiator in remodeling generic AI into highly effective, business-specific functions. To organize for this AI-driven future, we’re serving to our clients construct a sturdy, cloud-based information basis, with built-in safety and privateness. That’s the spine of AI readiness.
With the next generation of Amazon SageMaker introduced at re:Invent, we’re introducing an built-in expertise to entry, govern, and act on all of your information by bringing collectively extensively adopted AWS information, analytics, and AI capabilities. Collaborate and construct sooner from a unified studio utilizing acquainted AWS instruments for mannequin improvement, generative AI, information processing, and SQL analytics—with Amazon Q Developer helping you alongside the way in which. Entry all of your information whether or not it’s saved in information lakes, information warehouses, third-party or federated information sources. And transfer with confidence and belief, because of built-in governance to handle enterprise safety wants.

At re:Invent, we additionally launched key Amazon Bedrock capabilities that assist our clients maximize the worth of their information. Amazon Bedrock Data Bases now affords the one managed, out-of-the-box Retrieval Augmented Era (RAG) resolution, which permits our clients to natively question their structured information the place it resides, accelerating improvement. Help for GraphRAG generates extra related responses by modeling and storing relationships between information. And Amazon Bedrock Knowledge Automation transforms unstructured, multimodal information into structured information for generative AI—mechanically extracting, remodeling, and producing usable information from multimodal content material, at scale. These capabilities and extra assist our clients leverage their information to create highly effective, insightful generative AI functions.
Q: What did you’re taking away out of your buyer conversations at re:Invent?
Swami Sivasubramanian: I proceed to be amazed and impressed by our clients and the essential work they’re doing. We proceed to supply our clients the selection and specialization they should energy their distinctive use circumstances. With Amazon Bedrock Market, clients now have entry to greater than 100 in style, rising, and specialised fashions.
At re:Invent, I heard quite a bit concerning the new effectivity and transformative experiences clients are creating. I additionally heard about improvements which are altering folks’s lives. Like Actual Sciences, a molecular diagnostic firm, which developed an AI-powered resolution utilizing Amazon Bedrock to speed up genetic testing and evaluation by 50%. Behind that metric there’s an actual human worth—enabling earlier most cancers detection and personalised remedy planning. And that’s only one story amongst hundreds, as our clients attain increased and construct sooner, attaining spectacular outcomes that change industries and enhance lives.
I get excited after I take into consideration how we might help educate the subsequent wave of innovators constructing these experiences. With the launch of the brand new Schooling Fairness Initiative, Amazon is committing as much as $100 million in cloud expertise and technical sources to assist present, devoted studying organizations attain extra learners by creating new and modern digital studying options. That’s really inspiring to me.
In actual fact, the tempo of change, the outstanding improvements we launched at re:Invent, and the keenness of our clients all jogged my memory of the early days of AWS, when something appeared doable. And now, it nonetheless is.
Concerning the creator
Swami Sivasubramanian is VP, AWS AI & Knowledge. On this position, Swami oversees all AWS Database, Analytics, and AI & Machine Studying providers. His group’s mission is to assist organizations put their information to work with an entire, end-to-end information resolution to retailer, entry, analyze, and visualize, and predict.

