Data Labeling
Data labeling is the process of applying data categorization metadata to instances of that data.
- can take any form that is enduring, understandable, and consistent
- labels should be evident and communicate the pertinent concepts without necessarily disclosing the data they describe
- labels may indicate:
- data owner in terms of role
- data classification level
- data category
- date of creation
- date of scheduled destruction/disposal
- confidentiality level
- handling directions
- dissemination/distribution instructions
- access limitations
- source
- jurisdiction
- applicable regulation
- labels are often used as part of data management tools
- to allow lifecycle controls and support DLP functions