Hi. I'm brand new to this dataset.
I'm trying to sketch out in my mind how one would go about generating the data in the PatentsView files from the weekly/monthly files published by USPTO, and I've run into something that's puzzling me.
When I do ...
cat uspatentcitation.tsv | head
... on the current (12/05/2020) version of the file, I get a citation to a 1934 patent for patent id D809697. The ID of the 1934 patent is 1963218. The citation even has data on the assignee of the 1934 patent. None of this information is in the relevant patent grant record in ipg180206.xml. Since the patent grant data ( https://developer.uspto.gov/product/patent-grant-full-text-dataxml ) only goes back to 1976, it seems like I would not be able to fully construct this record from the electronic patent grant records. Does anyone know how this record would have been constructed in the uspatentcitation file?
More generally, if anyone has any good suggestions as to how I can quickly get up to speed on the topic of reconstructing the PatentsView files from raw data, I'd be most grateful.
Thanks for any guidance.