Data management strategies
Data is a corporate asset that helps make more-informed business decisions, improve marketing campaigns, optimize business operations and reduce costs. Learn data management strategies to ensure IT systems run business applications and provide analytical information that drives decision-making and strategic planning by corporate executives, business managers and end users.
Top Stories
-
News
30 Oct 2024
Alation browser-based extension aims to streamline analysis
The vendor's extension for Google Chrome embeds its tools in users' browsers to reduce time wasted toggling between applications and improve efficiency and speed to insight. Continue Reading
By- Eric Avidon, Senior News Writer
-
News
29 Oct 2024
Dremio updates data catalog to support all deployment types
Recognizing that some customers need to keep data on-premises, the vendor has extended the availability of its Data Catalog for Apache Iceberg beyond its cloud-based platform. Continue Reading
By- Eric Avidon, Senior News Writer
-
Feature
05 Apr 2019
USAA adds data engineering skills to speed data science work
When the United Services Automobile Association's data science team wasn't getting data in the right format, the team lead realized the USAA needed more data engineers. Continue Reading
-
News
04 Apr 2019
Tools manage performance for big data cloud applications
Tools such as Unravel and Pepperdata offer a way to measure performance of big data cloud applications, which may aid companies with on-premises configuration issues. Continue Reading
By -
Tip
29 Mar 2019
5 things to know about deploying big data systems in data containers
Planning for security and container APIs, and watching out for infrastructure sprawls are some issues to be aware of before deploying big data in containers. Continue Reading
-
News
25 Mar 2019
Facebook alumni forge own paths to big data analytics tools
Startups Interana and Rockset differ in their approaches to providing new query capabilities on fast-arriving big data. Both are led by technologists who started at Facebook. Continue Reading
By -
Feature
27 Feb 2019
8 tips to improve the data curation process
A data curation and modeling strategy can ensure accuracy and enhance governance. Experts offer eight best practices for curating data. First, start at the source. Continue Reading
-
Reference
27 Feb 2019
states of digital data
A state of digital data is a way to describe the current functionality of a data file. There are three major states of data: data at rest, data in motion and data in use. Continue Reading
-
Feature
25 Feb 2019
Explore Hadoop distributions to manage big data
Discover the uses of Hadoop distributions and the first steps in evaluating these products, as well as how the merger of rivals Cloudera and Hortonworks affects the market. Continue Reading
By- David Loshin, Knowledge Integrity Inc.
-
News
14 Feb 2019
Originators form group to boost Presto SQL query engine
The Presto engine arose as an alternative to Hive for big data queries. Now, the Presto Software Foundation has formed to promote the SQL query software's virtues. Continue Reading
By -
News
01 Feb 2019
Cloud data management, security top of mind for government
Federal government data officers grapple with cloud data management, weighing lower cost and efficiencies against security threats and vendor lock-in. Continue Reading
By -
News
15 Jan 2019
Cloudera and Hortonworks combo to push CDP, machine learning
Two wunderkinds of Hadoop have formalized their merger. Cloudera and Hortonworks say they will place special focus on AI as they chart the stand-alone vendor's future. Continue Reading
By -
Podcast
19 Dec 2018
Open source support was central to 2018 data deals
Mergers and acquisitions unsettled the big data status quo in 2018. Open source support made these couplings a bit different than those of the past, Talking Data podcasters said. Continue Reading
By -
Tip
20 Nov 2018
Trifacta data prep tool helps blend disparate data sources
Handling diverse data sources usually consumes precious developer time. That led healthcare CRM company SymphonyRM to hand the data prep task to business analysts. Continue Reading
By -
Tip
31 Oct 2018
How to build a master data index: Static vs. dynamic indexing
Expert David Loshin explores the differences between static and dynamic indexing in master data management systems, and which queries each approach can support. Continue Reading
By- David Loshin, Knowledge Integrity Inc.
-
Tip
31 Oct 2018
How deterministic and probabilistic matching work
Expert David Loshin explores the benefits and challenges of the two classes of record matching in master data management systems: deterministic matching vs. probabilistic matching. Continue Reading
By- David Loshin, Knowledge Integrity Inc.
-
Feature
11 Oct 2018
Cloud buoys data microservices -- for on-premises systems, too
Data in a microservices architecture is percolating anew. This news analysis looks at IBM Cloud Private for Data and other means to harmonize data in public and private locations. Continue Reading
By -
News
04 Oct 2018
Cloudera-Hortonworks merger narrows Hadoop users' options
Hadoop users will have fewer choices as big data rivals Cloudera and Hortonworks unite. But the new company may be more competitive with AWS and Google. Continue Reading
By -
Tip
27 Sep 2018
The gradual evolution of master data management software
Master data management began with a bang, then hit roadblocks due to complexity. Now, MDM is shifting toward more pragmatic projects tied to data governance. Continue Reading
By- Andy Hayler, Information Difference
-
Opinion
21 Sep 2018
5 trends driving the big data evolution
The speedy evolution of big data technologies is connected to five trends, including practical applications of machine learning and cheap, abundantly available compute resources. Continue Reading
By- Mike Matchett, Small World Big Data
-
News
13 Sep 2018
Containers key for Hortonworks alliance on big data hybrid
Hortonworks is joining with Red Hat and IBM to work together on a hybrid big data architecture format that will run using containers both in the cloud and on premises. Continue Reading
By -
News
07 Sep 2018
Big data tooling rolls with the changing seas of analytics
Hadoop data tooling is expanding. A view holds that Hadoop is moving from alternate data warehousing to a full-fledged big data analytics offering. Continue Reading
By -
Definition
06 Sep 2018
customer data integration (CDI)
Customer data integration (CDI) is the process of defining, consolidating and managing customer information across an organization's business units and systems to achieve a "single version of the truth" for customer data. Continue Reading
By- Jacqueline Biscobing, Senior Managing Editor, News
-
News
08 Aug 2018
Confluent Platform 5.0 aims to mainstream Kafka streaming
Confluent Platform updates seek to bring data streaming with Apache Kafka to a wider audience. A new GUI and user-defined functions are part of the 5.0 release. Continue Reading
By -
Podcast
02 Jul 2018
Hadoop data governance services surface in wake of GDPR
GDPR influence is touching a Hadoop big data world that was immune to many privacy considerations until now. This podcast features the rise of Hadoop data governance for data lakes. Continue Reading
By -
Tip
28 Jun 2018
Four first steps for customer data management
Forrester's Mike Gualtieri details how to develop a unified plan to manage customer data that gives business users what they need to manage CRM programs. Continue Reading
By- Mike Gualtieri, Forrester Research
-
News
25 Jun 2018
Hadoop data lake architecture tests IT on data integration
Hortonworks users talk about building Hadoop data lakes to support new applications -- and the challenges their teams face on ingesting and refining data for end users. Continue Reading
By -
News
18 Jun 2018
Hortonworks cloud options grow via Google, Microsoft, IBM
Hortonworks now supports Google Cloud Storage and has also broadened cloud deals with Microsoft and IBM, aiming to increase cloud uses of its big data platform. Continue Reading
By -
Feature
07 Jun 2018
Google Cloud data lake fuels cloud payment processing flow
To create a cloud payment processing system, Global Payments first had to deploy a data lake in the Google Cloud. Getting quick user feedback was another early step. Continue Reading
By -
Tip
05 Jun 2018
Why Spark DataFrame, lazy evaluation models outpace MapReduce
Learn how the Spark DataFrame execution plan works and why its lazy evaluation model helps the processing engine to avoid the performance issues inherent in Hadoop MapReduce. Continue Reading
By- David Loshin, Knowledge Integrity Inc.
-
Podcast
01 Jun 2018
Starburst finds new worlds to conquer with SQL query engine
Relational databases may have hit a wall of late, but the SQL query engine seems poised for wider growth. Starburst, a retro startup of sorts, is among those looking to take it wider still. Continue Reading
By - 09 Apr 2018
- 09 Apr 2018
- 09 Apr 2018
-
Feature
09 Apr 2018
IT teams take big data security issues into their own hands
Data security needs to be addressed upfront in deployments of big data systems -- and users are likely to find they have to build some security capabilities themselves. Continue Reading
By- Craig Stedman, Industry Editor
-
Podcast
29 Mar 2018
Kubernetes container orchestration gets big data star turn
The new thing in big data is Kubernetes container orchestration. While it's still early, there are signs of activity, which are cited in this edition of the Talking Data podcast. Continue Reading
By -
Feature
21 Mar 2018
Kubernetes gains momentum in big data implementation process
Big data vendors and users are looking to Kubernetes-managed containers to help accelerate deployments and enable more flexible use of computing resources. Continue Reading
By- Craig Stedman, Industry Editor
-
News
15 Mar 2018
Streaming tool from StreamSets eyes data in motion for GDPR
StreamSets software for inspecting big data brings governance to data in motion. Such capabilities may find more use as the European Union's GDPR deadline looms. Continue Reading
By -
Definition
21 Feb 2018
TensorFlow
TensorFlow is an open source framework developed by Google researchers to run machine learning, deep learning and other statistical and predictive analytics workloads. Continue Reading
By -
Opinion
02 Feb 2018
Data managers should study up on GPU deep learning
As GPU deep learning becomes more common, data managers will have to navigate several new layers of complexity in their quest to build or buy suitable data infrastructure. Continue Reading
By -
Tip
01 Feb 2018
Three ways to turn old files into Hadoop data sets in a data lake
Hadoop data lakes offer a new home for legacy data that still has analytical value. But there are different ways to convert the data for use in Hadoop depending on your analytics needs. Continue Reading
By- David Loshin, Knowledge Integrity Inc.
-
Definition
30 Jan 2018
Hadoop data lake
A Hadoop data lake is a data management platform comprising one or more Hadoop clusters. Continue Reading
By -
Podcast
23 Jan 2018
How AI and IoT will influence data management in 2018
AI and IoT will alter the data management landscape in 2018, according to analyst James Kobielus. AI will need regular updates, and DevOps will become more prevalent as a result. Continue Reading
By -
News
03 Jan 2018
Apache Hadoop 3.0 goes GA, adds hooks for cloud and GPUs
Is this the post-Hadoop era? Not in the eyes of Hadoop 3.0 backers, who see the latest update to the big data framework succeeding in machine learning applications and cloud systems. Continue Reading
By -
Feature
26 Sep 2017
Bank sees ETL processes evolve from IT to front-line users
ETL jobs -- once the sole province of IT -- take on a new form as data wrangling and self-service gain greater traction with business users of analytics. Continue Reading
By -
Feature
21 Aug 2017
Data management processes take on a new tenor in analyst's view
Changes in data management processes -- including self-service data preparation, data lakes and real-time analytics -- create a new landscape in companies, according to analyst Matt Aslett. Continue Reading
By -
Quiz
11 Jul 2017
Quiz: What's your IQ on data lake implementation and management?
A data lake can provide lots of analytics value to an organization. This quiz can tell you if you know what it takes to deploy and manage one; it also contains info to help boost your knowledge. Continue Reading
By- Tréa Lavery, Editorial Assistant
-
Feature
24 Oct 2016
Reltio master data management architecture rides graph analytics wave
An onslaught of unstructured data, social and otherwise may change the nature of master data management platforms. Reltio Cloud may stand as an indicator. Continue Reading
By -
News
15 Sep 2016
HPE sells Vertica analytics, thanks to the growth of open source software
HPE is paring down its software holdings, including analytical database software in the Vertica line and other big data tools. A sale to Micro Focus is due to close next year, leaving users in some limbo for now. Continue Reading
By -
Feature
26 Apr 2016
Inside the MapR Hadoop distribution for managing big data
The MapR Hadoop distribution replaces HDFS with its proprietary file system, MapR-FS, which is designed to provide more efficient management of data, reliability and ease of use. Continue Reading
By -
Feature
29 Oct 2015
Three ways to build a big data system
In a book excerpt, author Dale Neef outlines and compares different approaches organizations can take when trying to bring a big data system into their IT environments. Continue Reading
By- Dale Neef
-
Feature
13 Aug 2015
How to identify master data in a multi-domain MDM program
In an excerpt from their book on managing multi-domain master data management programs, Mark Allen and Dalton Cervo explain how to identify MDM domains and your master data. Continue Reading
By- Mark Allen and Dalton Cervo
-
Quiz
11 Jun 2015
Apache Spark architecture quiz: Do you know Spark?
Apache Spark is an open source processing engine designed for use in big data applications. Take this quiz to see if you're up to speed on Spark's history and features. Continue Reading
-
01 May 2015
Key criteria for deciding if a relational DBMS meets your IT needs
For many organizations, a relational database management system will get the job done; others need to keep looking to fulfill their needs. Continue Reading
- 01 May 2015
-
Answer
19 Feb 2015
Five steps to implementing an MDM program
Instituting a master data management program involves discovery, analysis, construction, implementation and sustainment processes, according to MDM expert Anne Marie Smith. Continue Reading
By- Anne Marie Smith, Ph.D., Alabama Yankee Systems, LLC
-
Feature
27 Sep 2013
Big data, fast: Avoiding Hadoop performance bottlenecks
A variety of performance issues can bog down Hadoop clusters. But there are ways to sidestep the pitfalls and keep your big data environment humming. Continue Reading
By -
Feature
15 Aug 2013
Gartner describes the building blocks for a strong MDM program
MDM teams should consider these seven building blocks, outlined by Saul Judah, research director on the information management team at Gartner Inc. Continue Reading
By -
Definition
30 Apr 2012
master data
Master data is the core data that is essential to operations in a specific business or business unit. Continue Reading
-
Tutorial
14 Feb 2011
MDM and SOA: An introduction to SOA and the benefits of SOA
Learn about MDM and SOA and how they relate. Read an introduction to SOA and learn about the benefits of SOA in this master data management and data governance book excerpt. Continue Reading
By- Alex Berson and Larry Dubov
-
Tutorial
14 Feb 2011
An introduction to enterprise architecture framework and MDM patterns
Get an introduction to enterprise architecture framework and MDM enterprise architecture patterns. Learn about MDM framework concepts and a definition of SOA. Continue Reading
By- Alex Berson and Larry Dubov
-
Tutorial
06 Jul 2010
Buyer’s Guide: Choosing and understanding MDM software
Read up on master data and master data management, plus get expert advice on choosing MDM software and tools and what to look for in an MDM product or service. Continue Reading
-
Feature
04 Feb 2010
Making business transaction processing and applications work
Learn about making business transaction processing work in your company, find tips on transaction processing applications and get transaction processing definitions. Continue Reading
-
Podcast
14 Dec 2009
Tips for a master data management (MDM) requirements gathering process
Embarking on a master data management (MDM) project? Get an expert's tips for the MDM requirements gathering process and ensure MDM implementation success. Continue Reading
-
Answer
15 Sep 2009
Six criteria for master data management (MDM) tool evaluation
Starting to evaluate MDM tools and software? Know what six areas you should look at to make sure you choose the best vendor and tool for your business needs. Continue Reading
By -
Answer
15 Sep 2009
Is it better to have a centralized or decentralized master data structure?
Should your master data be centralized or decentralized? Get an expert's take, plus find out how different MDM architecture styles affect your master data and MDM hub. Continue Reading
By -
Feature
02 Jul 2009
Top 13 master data management (MDM) buzzwords and definitions
Get the top master data management (MDM) terms, definitions and concepts and learn how MDM can improve your enterprise data. Also get links to useful MDM tutorials, training, video, podcasts and articles. Continue Reading
-
Answer
28 May 2009
Four must-have master data management project skills
There are four important master data management project skills needed for MDM projects and development efforts. Discover what they are and learn about MDM job roles and responsibilities in this tip. Continue Reading
By -
Feature
21 May 2009
The importance of managing data assets
Get an overview of data management and data lifecycle, learn how important managing data and data assets is and learn how to control data in this information management tutorial. Continue Reading
-
Answer
31 Mar 2009
What are the advantages/disadvantages of database abstraction layers?
Discover the advantages and disadvantages of using database abstraction layers, as opposed to database-specific coding, and learn why abstraction layers offer maximum flexibility. Continue Reading
By -
Feature
26 Jan 2009
Designing an MDM project plan
Master data management (MDM) projects require enterprise buy-in and participation in order to be successful. In this chapter, learn how to identify the people in the organization that can benefit from MDM and how to assemble an MDM project team. Discover how to establish and communicate the MDM business case, learn tips for collecting and prioritizing data requirements, developing a plan for enterprise data integration and designing a migration plan for the participating applications. Learn how to tackle these key steps and how to develop a successful MDM project plan in this chapter. Setting the stage for assembling the right team for an MDM program involves determining the best opportunities to socialize the internal value across the enterprise landscape. As this chapter has shown, this socialization is performed along a number of avenues: Communicating the value of presenting a centralized master data asset to the business clients, application owners, and operations staff Identifying the key stakeholders that will participate in the MDM program Carefully articulating within a project charter what will be achieved by the MDM program and how the corresponding milestones will be reached Providing a means for coordinating the participants to reduce conflict and encourage consensus Carefully detailing roles and responsibilities Analyzing the data requirements to establish MDM feasibility ■ ■ ■ ■ ■ ■ 2.7 Summary Continue Reading
-
Answer
14 Oct 2008
What's the difference between SOA and Web services?
How are Web services and service oriented architecture (SOA) different? Find out in this expert response. Continue Reading
By -
Definition
18 Sep 2008
CRUD cycle (Create, Read, Update and Delete Cycle)
The CRUD cycle describes the elemental functions of a persistent database in a computer. Continue Reading
-
Feature
13 Mar 2008
IT project management: The cost estimating process
Learn about the cost estimating process in IT project management by reading examples and definitions of cost estimation terms, in this free chapter download. Continue Reading
-
Feature
28 Jan 2008
Personal information management: History and details
Learn about personal information management and why managing personal information is especially important in today's data inundated world. Continue Reading
-
News
22 Jan 2008
Active data warehousing explained and examined
Active data warehousing brings business intelligence to more operational decision-making processes. Find out how, and why, active data warehouses enable competitive advantages. Continue Reading
By- Mark Whitehorn, University of Dundee
-
Quiz
18 Dec 2007
Enterprise data integration quiz
Take this enterprise data integration quiz to test your knowledge of data integration terms, trends and technologies. Continue Reading
-
Feature
01 May 2007
MDM-CDI project implementation guidelines
This chapter excerpt deals with the practical aspects of implementing MDM-CDI solutions as complex, multidisciplinary, enterprise-wide projects or programs. Implementing MDM-CDI solutions requires an approach that may be even broader than a single project, and should be managed as an initiative-level, enterprise-wide program. Continue Reading
-
Feature
20 Apr 2007
Data governance: Information ownership policies and roles explained
Who owns the data? This chapter defines information ownership boundaries and best practices, and outlines the specific roles and responsibilities of all involved in the battle for data governance. Continue Reading
- Answer 05 Sep 2003
-
Answer
21 Jun 2002
Difference between BI in the public sector and private sector
Find out the major difference between business intelligence (BI) in the public sector and BI in the private sector and get an expert's take on trends for those using BI in the public sector. Continue Reading
- Answer 20 Mar 2002