Omdia Analysts' Perspectives

View All Series Articles

Opinion

Amazon's innovative generative AI moves at re:Invent 2023

Analyst Mike Leone takes a comprehensive look at Amazon's generative AI announcements at this year's re:Invent, including the Q assistant, Bedrock enhancements and chip updates.

Mike Leone

By

Mike Leone, Principal Analyst
Omdia
Intelligence and advice powered by decades of global expertise and comprehensive coverage of the tech markets.

Published: 12 Dec 2023

AWS re:Invent marked the culmination of 2023, wrapping up a year dominated by generative AI. As usual, AWS took over the Las Vegas strip with events, sessions and keynotes across most of the hotel properties.

The event featured over 250 enterprise tech announcements, spanning data management, analytics, storage, app development, governance and security. Notably, the pervasive theme was the prevalence of generative AI.

Overall, AWS' focus on simplicity, flexibility and stakeholder empowerment was palpable. Rather than overwhelming attendees with a bewildering array of 100 new services, AWS' approach this year was simpler and more calculated. It was a more mature AWS than we've seen in the past.

Although the event catered to builders, it spotlighted the offloading of technical complexities. AWS positioned itself to start taking on some of the heavy lifting, enabling users to accomplish more with data faster and more reliably than before. Additionally, several announcements focused on performance improvements and optimizations designed to deliver cost savings while enabling customers to store, access, process and analyze data faster and more confidently.

It's remarkable that AWS, which some have perceived as behind competitors from a generative AI standpoint, has not only caught up but is introducing innovative differentiators that put the competition on notice. Collectively, I was impressed with all the announcements, integrations and the ambitious roadmap at this year's re:Invent. It was a week of drinking from a firehose, and there are no signs of a slowdown in 2024.

Continuing the path of delivering a zero-ETL future

AWS recognizes that customers need to break down data silos, and it's a big reason why the company is pushing for a future that eliminates complex, manual data integration efforts by moving away from the extract, transform and load (ETL) model.

Last year, AWS made its first zero-ETL announcements. This year saw an expansion of four new zero-ETL integrations. These included three connections to Amazon Redshift from Aurora PostgreSQL, RDS for MySQL and DynamoDB, as well as another from DynamoDB to OpenSearch.

AWS appears committed to continued investment in making it easier to connect data and removing the challenges that come with building and maintaining data pipelines. By taking on that burden, AWS enables customers to reclaim time for gaining new insights and driving innovation.

Simplifying data cataloging and discovery with generative AI and Amazon DataZone

In last year's re:Invent recap, I highlighted Amazon DataZone as the biggest announcement of the event. This year, AWS announced a preview of new automation capabilities within DataZone that use generative AI to decrease the time needed to catalog data. This is great news for customers who struggle with not only cataloging but also providing comprehensive business context for their data.

The forthcoming capability eventually aims to use a large language model (LLM) in Amazon Bedrock to generate detailed descriptions of data and the underlying schema, then suggest ways to analyze it or tie it into a specific use case. And while customers have latched on to DataZone for its ability to deliver trusted data, these new capabilities are poised to gain rapid traction, addressing customers' consistent struggle to deliver contextual data across business functions.

Delivering a comprehensive generative AI stack fueled by Amazon Bedrock

AWS dedicated considerable time to discussing the company's approach to supporting generative AI with a comprehensive infrastructure stack. The foundation is an infrastructure layer for model training and inference, followed by a middle layer that enables businesses to incorporate generative AI into their processes and products in a flexible and reliable way. The top layer focuses on building modern apps powered by the underlying layers.

Many of the AWS services that can be found across the three-layer stack have been released. But this announcement was more about framing for me. In a competitive landscape focused on delivering a complete platform, AWS' approach positions the company in the same conversation. If anything, this messaging puts AWS on the same competitive level, which was necessary given the company's slow rollout of generative AI services at the beginning of the year.

New chips and capabilities accelerate building, training and deploying generative AI

AWS announced two next-generation chips: Graviton4 and Trainium2. Graviton4 is a fourth-generation ARM-based processor, which AWS claims is 30% faster than Graviton3 with 50% more cores. Trainium2 is a second-generation training chip that AWS says is four times faster than its predecessor and can enable customers to achieve 65 exaflops of performance. While these figures might seem excessive, these advancements are predicted to reduce the training time for the largest LLMs from months to weeks.

Of course, there were new announcements in partnership with Nvidia. When it comes to hardware-specific announcements, it's really about accessibility of hardware and giving customers the flexibility to choose what works best for their particular use cases.

This flexibility extends to enabling customers interested in building and customizing their own models to create unique experiences with new capabilities in Amazon SageMaker. These capabilities are focused on helping customers accelerate model building, training and deployment. Specifically, AWS says that SageMaker HyperPods help reduce the time required to train foundation models by up to 40% and make it far more efficient to manage massive, distributed clusters. AWS also introduced some inference optimizations aimed at reducing model deployment costs by an average of 50%, achieved by delivering multiple models within the same instance type.

New Amazon Bedrock integrations and capabilities

AWS announced a number of new Bedrock capabilities -- far too many to cover in this article. However, a few highlights stood out to me.

The first was Guardrails for Bedrock, which aims to help organizations leverage generative AI in a responsible way. This feature is the first step in empowering customers to easily configure content filtering or redact certain information from responses. Another standout is Knowledge Bases for Bedrock, a feature that securely connects models in Bedrock to company data using retrieval-augmented generation.

Finally, Agents for Bedrock also aims to help accelerate generative AI development by using the reasoning capabilities of foundation models to break down user-requested tasks into multiple steps. This provides customers with improved control over the orchestration of responses as well as better visibility into the chain-of-thought reasoning process.

A new generative AI assistant: Amazon Q

With AI assistants all the rage, Amazon announced Amazon Q, a new generative AI assistant set to launch across the entire AWS ecosystem of services. I would argue this was the biggest announcement of the event because it places AWS alongside the company's top competitors, who also have intelligent, natural language assistants. Amazon Q is slated to eventually integrate with the full AWS portfolio, including Redshift, AWS Glue, Amazon Connect and Amazon QuickSight, the latter of which has featured a version of Q dedicated to business intelligence tasks for some time.

Customers can engage in conversations with Amazon Q to solve problems, generate content and take actions. The assistant can understand company information, code and systems, and can personalize interactions based on the user's role and permissions. It is also built with a strong emphasis on security and privacy. Amazon Q is poised to eventually provide guidance to all teams and stakeholders interacting across the AWS ecosystem of services, including developers, line-of-business leaders and data specialists.

Mike Leone is a principal analyst at TechTarget's Enterprise Strategy Group, where he covers data, analytics and AI.

Enterprise Strategy Group is a division of TechTarget. Its analysts have business relationships with technology vendors.

Next Steps

AWS customers grapple with generative AI shortfalls

New dev tools at AWS re:Invent shape the future of cloud

Dig Deeper on Machine learning platforms

Search Business Analytics

Why ethical use of data is so important to enterprises
Enterprises that don't use data ethically have a lot to lose. To maintain their businesses' trustworthiness and value, executives...
Domo adds App Catalyst to platform to aid AI development
By combining natural language code generation with enterprise-grade security and governance, the vendor aims to help customers ...
The future of business intelligence: 10 top trends in 2026
Here are 10 key trends affecting the current state and future direction of BI initiatives that analytics leaders should be aware ...

Search CIO

Inside a CIO's mind: Mastering time and knowing the business
CIO Sean McCormack explains how he balances strategy, vendors and frontline engagement -- and why his to-do list lives on his ...
CIOs are feeling the pressure of the AI leadership gap
In this Q&A, Wendy Lynch, founder of Analytic Translator, discusses how CIOs need to close a leadership gap to overcome the huge ...
Why companies should be sustainable and how IT can help
Pressure is mounting for the business sector to address its environmental footprint and become more sustainable. Here's a look at...

Search Data Management

Databricks launches PostgreSQL Lakebase to aid AI developers
Resulting from the $1B acquisition of Neon, the database built for AI workloads -- including separate compute and storage -- is ...
Pentaho update aids data integration, semantic modeling
The vendor's latest platform update aims to speed, simplify and better govern workloads to help customers build a trusted ...
Snowflake launches new AI tools, unveils OpenAI partnership
New features such as an agent-powered code generator and automated semantic modeling simplify developing cutting-edge ...

Search ERP

Who's really governing enterprise systems: IT or leaders?
Across ERP, HR software and mobile platforms, governance decisions are being set earlier, often before organizations realize ...
C-suite should make AI data management the 2026 ERP priority
Aligning data lakehouses with those of ERP vendors and data partners is important, but it won't be enough without silo-busting ...
8 ERP security best practices for modern ERP environments
As supply chain attacks continue, ERP security requires strong authentication, regular patching, monitoring and incident response...

Close