Talend Unveils Open Source Clean Data App -- ADTmag

Talend Unveils Open Source Clean Data App

By Will Kraft
August 26, 2008

Talend last week announced an open source software package to help clean data records. The San Diego-based company's new Data Quality product identifies and helps to remedy so-called "dirty data" in database management systems.

Dirty data is typically seen with nicknames and shortened street addresses in fields, which can lead to duplicate records. Talend's solution updates the data with standardized information gathered from the U.S. Postal Service and other sources.

The Data Quality product allows companies to easily distinguish between "Peggy," "Peg," "Marge" and "Meg" (all variations of "Margaret") when they reference the same person. It can match "William Smith at 15 Main Street" with "Billy Smith at 15 Main Str." Such inconsistencies have resulted in lost or redundant mailings in the past.

The software includes data profiling, cleansing and enrichment functions. Data profiling allows a company to track data degradation over time. With data cleansing, the software corrects "bad" data by cross-checking against other databases and reference data.

The data enrichment feature associates additional information with the data, which can be used to help target mailings to a specific demographic. The additional information might include latitude and longitude, census data, and credit scores.

The Data Quality product will be available in September as an individual product or as an extension to the Talend Integration Suite, which is the company's data integration service.

Earlier this summer, the company also announced a new open source data profiler called Talend Open Profiler. More information about Talend's data integration solutions can be found here.

About the Author

Will Kraft is a Web designer, technical consultant and freelance writer. He can be reached at [email protected]. Also, check out his blog at http://www.willkraftblog.com.

Featured

AppTrends

Email Address*Country*

Please type the letters/numbers you see above.

Upcoming Training Events

0 AM

Live! 360 2-Day Hands-On Seminar: AI-Powered .NET Development with Claude & Claude Code
July 9-10, 2026

VSLive! 4-Day Hands-On Training Seminar: Immersive .NET Full Stack Training with CoPilot: 4-Day Hands-On Experience
July 14-17, 2026

Visual Studio Live! @ Microsoft HQ
July 27-31, 2026

Visual Studio Live! @ San Diego
September 14-18, 2026

The AI Pivot
September 25, 2026

Live! 360 6-Week Training & Certification Course: Mastering the Microsoft AI Framework: Building Enterprise-Ready AI Agents with Microsoft Foundry
October 6–November 10, 2026

VSLive! 6-Week Training & Certification Course: Blazor Developer Accelerator: Hands-On Skills for Real-World .NET Teams
October 7 – November 11, 2026

Live! 360 Orlando
November 15-20, 2026

Artificial Intelligence Live! Orlando
November 15-20, 2026

AI Enterprise Architecture Live! Orlando
November 15-20, 2026

Cybersecurity & Ransomware Live! Orlando
November 15-20, 2026

Data Platform Live! Orlando
November 15-20, 2026

Visual Studio Live! Orlando
November 15-20, 2026

Live! 360 2-Day Hands-On Seminar: AI-Powered .NET Development with Claude & Claude Code
December 8-9, 2026

VSLive! 4-Day Hands-On Training Seminar: Immersive .NET Full Stack Training with CoPilot: 4-Day Hands-On Experience
December 15-18, 2026

Visual Studio Live! Las Vegas
March 22-26, 2027

Visual Studio Live! @ Microsoft HQ
August 2-6, 2027

Free White Papers

More Tech Library