Data Labeling


Data labeling is the process of applying data categorization metadata to instances of that data.

  • can take any form that is enduring, understandable, and consistent
  • labels should be evident and communicate the pertinent concepts without necessarily disclosing the data they describe
  • labels may indicate:
    • data owner in terms of role
    • data classification level
    • data category
    • date of creation
    • date of scheduled destruction/disposal
    • confidentiality level
    • handling directions
    • dissemination/distribution instructions
    • access limitations
    • source
    • jurisdiction
    • applicable regulation
  • labels are often used as part of data management tools
    • to allow lifecycle controls and support DLP functions