# PII Scanner(Postgres)

We're thrilled to announce the release of our PII Scanner after three months of dedicated effort. This powerful tool scans your PostgreSQL database using three advanced methods: Meta Scan, Data Scan, and NLP Scan.

Our solution outperforms existing alternatives by at least 5X. Leveraging advanced techniques like SpaCy, we can accurately detect PII entities such as first names and last names, which are often missed by traditional regex methods

**How to generate HTML Report ?**

Please select Option 4 as shown below and enter the values for the prompted inputs

NOTE -- Watch short demo video at [https://youtu.be/HtP8N0Op-V4 ](https://youtu.be/HtP8N0Op-V4)

<figure><img src="https://3013412235-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2Fj94nGPKUS9dlzKveRYHc%2Fuploads%2Fn0SD75dLT1eQECZpJuT7%2Fimage.png?alt=media&#x26;token=9ad0f57f-2cd2-476f-9b22-3641f76afc00" alt=""><figcaption></figcaption></figure>

**Sample HTML Report**

<figure><img src="https://3013412235-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2Fj94nGPKUS9dlzKveRYHc%2Fuploads%2F6F3gfkLH7IdD9o6Qu6WS%2Fimage.png?alt=media&#x26;token=f779bb6a-dfff-4256-b943-27e1fd59a00b" alt=""><figcaption></figcaption></figure>

**Metascan method**

This method matches based on column names, not data. For instance, if you have a column named 'first\_name' or 'SSN,' it will be flagged as a match. Note: This approach only checks column names, not the actual data

**Datascan method(Default method)**

This method matches based on data . It selects upto 10,000 records per table to identify possible matches&#x20;

**SpaCy method**

Leveraging advanced techniques like SpaCy, we can accurately detect PII entities such as first names and last names, which are often missed by traditional regex methods

**How does our solution outperform other existing alternatives for this feature?**

Our solution supports a wider range of entities than other available options. We also employ advanced techniques like Spacy for matching last names, first names, and more. Additionally, we provide an HTML report, making it easier and faster to identify issues

**Supported entities**&#x20;

* Name&#x20;
* Email  Address&#x20;
* Username&#x20;
* SSN&#x20;
* PO Box&#x20;
* IPAddress (ipv4)&#x20;
* Mac Address&#x20;
* OAuthToken&#x20;
* Location&#x20;
* Nationality&#x20;
* Gender&#x20;
* Bank Account Number&#x20;
* PAN Number&#x20;
* Adhar card Number&#x20;
* ITIN&#x20;
* Driving License Number&#x20;
* Passport Number&#x20;
* NHS Number&#x20;
* Password&#x20;
* Phone&#x20;
* Credit Card&#x20;
* Birth Date&#x20;
* Zip Code

**References taken for some regexes**

US Driving Licence [ https://docs.trellix.com/bundle/data-loss-prevention-11.10.x-classification-definitions-reference-guide/page/GUID-CA4A41FB-B897-4910-809E-ED33DEF9CE77.html](https://docs.trellix.com/bundle/data-loss-prevention-11.10.x-classification-definitions-reference-guide/page/GUID-CA4A41FB-B897-4910-809E-ED33DEF9CE77.html)&#x20;

and[ https://success.skyhighsecurity.com/Skyhigh\_Data\_Loss\_Prevention/Data\_Identifiers/U.S.\_Driver's\_License\_Numbers](https://success.skyhighsecurity.com/Skyhigh_Data_Loss_Prevention/Data_Identifiers/U.S._Driver's_License_Numbers)

India driving licence | regex source =>[ https://www.geeksforgeeks.org/how-to-validate-indian-driving-license-number-using-regular-expression/](https://www.geeksforgeeks.org/how-to-validate-indian-driving-license-number-using-regular-expression/)&#x20;

Some regexes from pdscan , piicatcher  and other sources&#x20;


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://klouddb.gitbook.io/klouddb_shield/pii-scanner-postgres.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
