(Usage hints for this presentation)
Winter Term 2023/2024
Dr. Jens Lechtenbörger (License Information)
lechten
britney spears
corrected by Google
Kimball, Dealing with Dirty Data, DBMS, 1996
TargetCode | TargetName |
---|---|
0 | unknown |
1 | female |
2 | male |
3 | diverse |
Lookup | Data Source | Source | TargetCode |
---|---|---|---|
S1 | unknown | 0 | |
S1 | female | 1 | |
S1 | male | 2 | |
S1 | diverse | 3 | |
S2 | f | 1 | |
S2 | m | 2 | |
S2 | ? | 0 | |
S2 | NULL | 0 | |
S3 | 1 | 2 | |
S3 | 2 | 1 | |
… | … | … |
ID | Name | Surname | Birthdate | |
---|---|---|---|---|
1 | John | Smith | 03/17/1974 | smith@abc.it |
2 | Edward | Monroe | 02/03/1967 | NULL (does not exist) |
3 | Anthony | White | 01/01/1936 | NULL (existing but unknown) |
4 | Marianne | Collins | 11/20/1955 | NULL (not known if existing) |
(Source: [SMB05])
Source files are available on GitLab (check out embedded submodules) under free licenses. Icons of custom controls are by @fontawesome, released under CC BY 4.0.
Except where otherwise noted, the work “Data Quality”, © 2005-2021, 2023 Jens Lechtenbörger and © 2006-2019 Gottfried Vossen, is published under the Creative Commons license CC BY-SA 4.0.
In particular, trademark rights are not licensed under this license. Thus, rights concerning third party logos (e.g., on the title slide) and other (trade-) marks (e.g., “Creative Commons” itself) remain with their respective holders.