pickabook books with huge discounts for everyone
pickabook books with huge discounts for everyone
Visit our new collection website www.collectionsforschool.co.uk
     
Email: Subscribe to news & offers:
Need assistance? Log In/Register


Item Details
Title: AN INTRODUCTION TO DUPLICATE DETECTION
By: Felix Naumann, Melanie Herschel, M. Tamer Ozsu
Format: Paperback

List price: £30.50


We currently do not stock this item, please contact the publisher directly for further information.

ISBN 10: 1608452204
ISBN 13: 9781608452200
Publisher: MORGAN & CLAYPOOL PUBLISHERS
Pub. date: 12 March, 2010
Series: Synthesis Lectures on Data Management
Pages: 87
Description: Automatically detecting duplicates is difficult. Duplicate representations are usually not identical but slightly differ in their values, and in principle all pairs of records should be compared, which is unfeasible for large volumes of data. This volume examines how to overcome these difficulties.
Synopsis: With the ever increasing volume of data, data quality problems abound. Multiple, yet different representations of the same real-world objects in data, duplicates, are one of the most intriguing data quality problems. The effects of such duplicates are detrimental; for instance, bank customers can obtain duplicate identities, inventory levels are monitored incorrectly, catalogs are mailed multiple times to the same household, etc. Automatically detecting duplicates is difficult: First, duplicate representations are usually not identical but slightly differ in their values. Second, in principle all pairs of records should be compared, which is infeasible for large volumes of data. This lecture examines closely the two main components to overcome these difficulties: (i) Similarity measures are used to automatically identify duplicates when comparing two records. Well-chosen similarity measures improve the effectiveness of duplicate detection. (ii) Algorithms are developed to perform on very large volumes of data in search for duplicates. Well-designed algorithms improve the efficiency of duplicate detection. Finally, we discuss methods to evaluate the success of duplicate detection.
Illustrations: black & white illustrations
Publication: US
Imprint: Morgan & Claypool Publishers
Returns: Non-returnable
Some other items by this author:
ACCESS CONTROL IN DATA MANAGEMENT SYSTEMS (PB)
ADVANCED METASEARCH ENGINE TECHNOLOGY (PB)
ADVANCES IN MULTIMEDIA INFORMATION SYSTEMS (PB)
ADVANCES IN OBJECT-ORIENTED DATABASE SYSTEMS (PB)
AN INTRODUCTION TO DUPLICATE DETECTION
CONCEPTUAL MODELING (PB)
CURRENT TRENDS IN DATA MANAGEMENT TECHNOLOGY (PB)
DATA INTEGRATION IN THE LIFE SCIENCES (PB)
DATA PROFILING
DATA PROFILING
DATA PROFILING (HB)
DATA STREAM MANAGEMENT
DATA STREAM MANAGEMENT (PB)
DATA STREAM MANAGEMENT (PB)
DATABASE ON DEMAND (PB)
DECLARATIVE NETWORKING (PB)
DISTRIBUTED AND PARALLEL DATABASE OBJECT MANAGEMENT (HB)
DISTRIBUTED AND PARALLEL DATABASE OBJECT MANAGEMENT (PB)
ENCYCLOPEDIA OF DATABASE SYSTEMS
ENCYCLOPEDIA OF DATABASE SYSTEMS
ENCYCLOPEDIA OF DATABASE SYSTEMS (HB)
ENCYCLOPEDIA OF DATABASE SYSTEMS (PB)
FUNDAMENTALS OF OBJECT DATABASES (PB)
FUNDAMENTALS OF PHYSICAL DESIGN AND QUERY COMPILATION (PB)
INTRODUCTION TO DUPLICATE DETECTION
KEYWORD SEARCH IN DATABASES (PB)
MANAGING EVENT INFORMATION (PB)
PRINCIPLES OF DISTRIBUTED DATABASE SYSTEMS (HB)
PRINCIPLES OF DISTRIBUTED DATABASE SYSTEMS (PB)
PRIVACY-PRESERVING DATA PUBLISHING (PB)
PROBABILISTIC RANKING TECHNIQUES IN RELATIONAL DATABASES (PB)
QUALITY-DRIVEN QUERY ANSWERING FOR INTEGRATED INFORMATION SYSTEMS (PB)
SYNTHESIS SERIES IN COMPUTER AND INFORMATION SCIENCE (HB)
UNCERTAIN SCHEMA MATCHING (PB)
UNCERTAIN SCHEMA MATCHING (PB)
WEB PAGE RECOMMENDATION MODELS (PB)
WEB-AGE INFORMATION MANAGEMENT (PB)
WORKFLOW MANAGEMENT SYSTEMS AND INTEROPERABILITY (HB)



Information provided by www.pickabook.co.uk
SHOPPING BASKET
  
Your basket is empty
  Total Items: 0
 

NEW
World’s Worst Superheroes GET READY FOR SOME SUPERSIZED FUN!
add to basket





New
No Cheese, Please! A fun picture book for children with food allergies - full of friendship and super-cute characters!Little Mo the mouse is having a birthday party.
add to basket

New
My Brother Is a Superhero Luke is massively annoyed about this, but when Zack is kidnapped by his arch-nemesis, Luke and his friends have only five days to find him and save the world...
add to basket


Picture Book
Animal Actions: Snap Like a Crab
By:
The first title in a new preschool series from Guilherme Karsten.
add to basket