Getty Images/iStockphoto

Tip

Use LLMs for data analysis to improve business operations

Data professionals can use LLMs for data and predictive analytics work. Still, the analysis of large amounts of textual and visual data requires human oversight to succeed.

Large language models can augment data analysis, crunching more information and identifying deeper insights than data professionals can on their own.

An LLM performs natural language processing tasks. It is large because its core neural network has billions of parameters that it adjusts as it learns. The parameters are the essential components that define the model's skills. The more parameters an LLM has, the better it functions.

Data analysis focuses on understanding the meaning of data sets. It ranges from simple arithmetic, such as assuming the sales per quarter per store, to more complex statistics, such as calculating mean and median sales per store with adjustments for regional and seasonal variations. Data analysis can include quantitative assessments of qualitative inputs, such as assessing what percentage of spoken comments in a public meeting contained hostile or threatening language.

LLMs are not ready off the shelf to perform the role of a data analytics tool and answer detailed questions about the meanings of data sets accurately and consistently. Automated functions require training on the correct data sets to generate the most accurate results. Analysts must ensure that outputs are secure, accurate and ethically sound. An LLM is certainly an unreliable tool for non-analysts, and even trained analysts should use one cautiously.

How LLMs improve data analysis

It is possible for LLMs to perform analyses on files of structured numerical data. They can calculate statistics, look for trends and identify anomalies. Data analysts should only use LLMs for this type of data, however, if they limit the tool to look only at data in specific files and confine answers to material within those files.

Text analysis

An even better use of LLMs is to take advantage of their facility with language. Data analysts can use an LLM to accelerate text analysis and -- if it is multimodal and can interpret spoken language -- oral inputs. LLMs can transcribe spoken word inputs, translate languages and analyze the results by doing the following:

  • Highlighting categories of words.
  • Looking for commonalities among comments such as similar language, or references to the same people or things.
  • Providing semantic scoring of inputs based on defined categories of words. For example, being angry, despairing, engaged or credulous.
  • Pointing out contextual information associated with use of specific words or images.

Visual media analysis

If an LLM is trained with the ability to parse visual media, it can analyze the content of pictures, charts and videos. LLMs can follow straightforward prompts, such as looking for a specific kind of object -- say, how many hats are in a picture. They can also identify subtle elements, such as the prevalence of different color palettes across TikTok videos with a given hashtag.

LLMs can help analysts combine inherently unstructured data with structured data sets by converting free text, audio or video media into specific numerical data. Trained multimedia LLMs can also generate visualizations of data sets, ranging from traditional line or bar charts to word clouds and heat maps.

Predictive analytics. LLMs can allow analysts to analyze non-textual data, and, critically, to integrate those results with the analysis of standard numerical data. The combination broadens the reach of predictive analytics by enabling it to spot more trends. For example, an LLM would be able to identify patterns across media platforms more easily than human analysts could.

LLMs can tease out words in the data, understand the context of words and assign sets of words to themes. Using that information, analysts can then adjust the LLM model training for subsequent predictive analytics operations.

Examples of data analysis with LLMs

Analysts can use LLMs to provide insights and improve business operations in several ways.

Identify actions based on customer feedback

LLMs can help analysts understand patterns and trends in customer sentiment about specific products, services, store locations or staff. Analysts can use LLMs to analyze data from fixed-choice surveys, emails to sales teams and support desks, support and sales chat logs, social media postings and content from podcasts or product review videos. Companies can use the results to improve their operations and address shortcomings in their service or product portfolios.

Data analysts will not see LLMs replace their jobs anytime soon.

Identify business development opportunities

Analysts using LLMs can harvest data from sources including competitor websites, relevant social media channels and their own support channels. The information can help identify new product or service opportunities, open new stores or websites, move into new locations or form new partnerships with other companies.

Identify potential threats

Governments and corporations can analyze the content of material posted in public. The information can help augment data from other sources to spot potential security threats.

Will LLMs replace data analysts?

Data analysts will not see LLMs replace their jobs anytime soon.

Data professionals must learn more about what LLMs can do and how to keep the models honest. In the meantime, LLMs will be able to assist analysts, but not replace them. Organizations need analysts to craft prompts carefully and verify the accuracy of all outputs -- that is, until developers can consistently produce LLMs that don't make up data, present made-up data as real or make up conclusions not driven by the data.

John Burke is CTO and principal research analyst with Nemertes Research. With nearly two decades of technology experience, he has worked at all levels of IT, including end-user support specialist, programmer, system administrator, database specialist, network administrator, network architect and systems architect. His focus areas include AI, cloud, networking, infrastructure, automation and cybersecurity.

Dig Deeper on Business intelligence technology