Check for Personally-Identifiable InformationΒΆ

The curator should identify any variables that can directly or indirectly identify subjects.

See also

For information about confidentiality and disclosure risk, see

  1. For a data file, select the Check Personally-Identifiable Information task.

  2. The review page will show the following information:

    1. Instructions indicating how to perform this review.
    2. A list of all variables.
    3. A link to download the data file for manual modification.
  3. If you suspect the file may contain personally-identifiable information, take the following steps.

    1. Download the data file
    2. Remove or anonymize the appropriate information.
    3. Upload the new version of the file.


    Removing PII from the data file involves either using statistical software (e.g., Stata, R) to edit or write new code, or running a program/script that deletes or otherwise transforms these variables and writes out a new revised version of the data file, and adding the resulting data file to the catalog record (as well as any new code file).

  4. You should always manually review all variables to verify that no potentially personally-identifiable information is present.

  5. Once you are satisfied that the file does not contain personally-identifiable information, enter any desired comments and marks the review as complete.