Skip to main content
 
 
 
IN THIS SECTION
3 posts
lucy wang
Last seen: 08/03/2021 - 06:22
Joined: 03/10/2021 - 08:11
Patent id in cpc_current

Hi everyone,

I have a question regarding the data type in the cpc_current. I bulk downloaded the cpc_current dataset and find that the data type of the variable patent_id (which is the linking variable across different tables) is float64. However, if  I look at other tables (e.g., patent_inventor.tsv.zip), this variable is an object and contains value starting with the alphabet, (e.g.,  D474886), so does it mean it is impossible to find the classification for such patents with letter-including IDs, and why is that?

Thanks very much in advance for any feedback and comments!

Best,

Lucy

emelluso
Role: administrator
Last seen: 08/02/2021 - 13:19
Joined: 10/21/2020 - 07:51
Hello Lucy, to answer your…

Hello Lucy, to answer your question about the different variables of CPC classification, this is due to that design and plant patents are not included in the CPC reclassifications. So when you ingest the data, it comes in numeric format and you will have to manually set the field as character to match the other files for merging.

Best,

PVTeam

lucy wang
Last seen: 08/03/2021 - 06:22
Joined: 03/10/2021 - 08:11
Thanks very much for your…

Thanks very much for your reply! It helps a lot!

Best,

Lucy