First, thanks for making this amazing data available. I saw there was a similar question about this merge posted before but it didn't solve my issue.
I'm having trouble understanding why not the unique patent IDs present in the assignee_disambiguated and inventor_disambiguated are not the same (or one is the subset of the other). Why is this not true?
Trying to merge both, I find that 854 unique patent_ids in the assignee_disambiguated.tsv are not in the inventor_disambiguated.tsv. For recent years they are all statutory invention registration types. But for the inventor_disambiguated there are circa. 9.2 million unique patent_ids with no match in the assignee_disambiguated data. Does this happen because many patents have no assignee and do not show up in the assignee_disambiguated data?
Many thanks in advance,