Hi,
I am currently working on my master thesis and came over this package for R which could potentially save me a lot of hours. I have a list of companies collected from the VentureXpert database. For these companies I need yearly (2010, 2011, ... , 2019) amount of patent applications, granted patents, and if possible total amount of forward patent citations (for that company that year).
Currently, there are 336 firms with 10 firm-years (i = 336, t=10).
As you may understand this will be used to help create my panel data, and as I am measuring innovative performance, the above will be different dependent variables I am looking into.
Firm 1 Year A PatentApplications_1A PatentCitations_1A PatentsGranted_1A
Firm 1 Year B PatentApplications_1B PatentCitations_1B PatentsGranted_1B
. . . . .
. . . . .
. . . . .
Firm i Year t PatentApplications_it PatentCitations_it PatentsGranted_it
Of course, I do deploy control variables such as Firm Size (Assets), R&D Expenditures, and some industry controls.
My question is if anyone could help me with the code for collecting the patent data with this package? It seems my alternative is to manually go to the USPTO database and search up every firm every year, and skip citations. I've only had one course in R (RStudio) and I study MSc in Business, hence my "data science"/programming skills are limited. Previous research have mostly used the NBER (HJT) dataset, but it is only updated until 2006, and I have an independent variable which is not mature enough at that time, hence 2010-2020 is the time period I look into.
Patent Applications = Amount of patents filed for per firm per year (2010-2019)
Patents Granted = Amount of patents granted per firm per year (2010-2019)
Patent Citations = Amount of forward patent citations per firm per year (2010-2019)
--> Individual patent data is not requested. Only as a total per firm per year (total amount of patent citations/applications/grants per firm (company) per year).
Thanks in advance!