Dear Community & PatentsView-Team,
I have a short request to the Pre-Grant Publications Data Download Tables, because I am interested in data to all filed USPTO patent applications, so the pre-granted applications.
But if I match the PatentsView pre-grant dataset application.tsv (5,570,731 observation) with the original USPTO Patent Examination Research Dataset (Public PAIR) from 2019, I see that around 10 Million applications are missing in your dataset (as matching variable I used the application number). Can someone tell my why these applications are missing in the Pre-Grant Datasets? Or in other words which conditions had to be fulfilled by the applications to be part of the PatentsView-dataset?
And a second question: Why are there duplicated application-numbers in the PatentsView-dataset?
Thanks a lot in advance for your answers and help!
Best regards and stay healthy,