I have observed a notable decrease in the number of successful matches among datasets from the Per Granted Database when using
pgpub_id as the linking key, particularly after the filing year of 2014. However, this discrepancy is not present when matching datasets from the Granted Database using
patent_id as the linking key.
For instance, as depicted below, the
pg_published_application dataset contains 380,954 patents for the filing year of 2014 and 380,603 for 2015. Nonetheless, when matched with the
pg_assignee_disambiguated dataset, the number of successful matches dropped sharply from 75,180 in 2014 to 38,413 in 2015.
Below are the STATA code and the resulting output:
Considering my objective to examine the patent application behavior of firms from 2012 to 2019, I am seeking guidance on addressing and rationalizing the pronounced reduction in the number of patent records post-2014.
Thank you for your assistance!