Dear PV Team,
Thank you for the amazing work. I have been working on granted patents for a month or so now. When analyzing patent ID coverage across datasets of granted patents, I found that 157,558 patent IDs in the g_cpc_current are not in the g_patents file. But since the g_patents should contain all granted patents, I gather something odd is going on. Here are the first 50 hits (patent IDs in g_cpc_current but not in g_patents):
['11546372', '11460219', '11573335', '11508942', '11575402', '11511557', '11482116', '11543180', '11581590', '11592641', '11549755', '11493410', '11488231', '11559891', '11601117', '11463518', '11559189', '11582645', '11536968', '11517219', '11508394', '11555561', '11603227', '11462631', '11514961', '11516721', '11480090', '11604981', '11495865', '11580468', '11511735', '11501319', '11541708', '11461969', '11535795', '11587796', '11556722', '11501351', '11616925', '11614512', '11605898', '11520727', '11601479', '11473579', '11462059', '11552520', '11476479', '11589536', '11558638', '11492368']
Can someone shed some light on this?