Gender attribution in 30th Oct 2022 file - Unknowns have increased
Hi
I noticed that the gender attribution in the "inventor" data has changed in the latest 30th Oct 2022 dataset, compared to the 29th March 2022 dataset.
1. The number of patents with "unknown" male_flag has significantly increased
Data date--> | 29th March 2022 | 30th Oct 2022 |
patent_id - total count | 8061179 | 8168689 |
male - patent count with male inventor | 7582224 | 7452889 |
female - patent count with female inventor | 1554108 | 1636385 |
unknown - patent count with unknown inventor | 761735 | 1894117 |
2. The overall count of inventors has come down and 'unknown' have increased
Data date--> | 29th March 2022 | 30th Oct 2022 | ||
male_flag - Inventor count | Freq. | Percent | Freq. | Percent |
Female | 507,762 | 12.43 | 461,892 | 12.06 |
Male | 3,127,004 | 76.57 | 2,905,327 | 75.86 |
Unknown | 449,282 | 11 | 462,537 | 12.08 |
Total | 4,084,048 | 100 | 3,829,756 | 100 |
I need to sure whether the 30th Oct is better and reliable over the previous version.
Please advise.
Thanks!
Shashi