Data Masking


Data masking is a de-identification method where generic or placeholder labels are substituted for real data while preserving the structure or format of the original data.

  • can mean that all or part of the contents of a database field are redacted by substituting all character strings with “x”, for example
  • field might be partially redacted
    • to preserve metadata for analysis purposes
    • e.g., telephone number
      • dialing prefix might be retained, but the subscriber number is redacted
  • can use techniques to preserve the original format of the field