Skip to main content
 
 
 
IN THIS SECTION
5 posts
lucy wang
Last seen: 08/09/2021 - 09:45
Joined: 03/10/2021 - 11:11
what is the location_id in patent_assignee and patent_inventor file?

Hi, I am very confused with the location_id in the patent_inventor and patent_assignee dataset. From the directory, it says location_id is the unique location id generated by the disambiguation algorithm. So I try to match a random location_id from the patent_assignee dataset  (e.g. ad38f64e-f7dd-4268-9215-971dfdf071fe with patent_id = 6795487) with the id in the location file, which shares the same definition, to see what state it represents for. But I couldn't find any match.

Besides, is the location_id in the patent_inventor dataset denotes the location of the patent, while that in the patent_assignee dataset denotes the location of the assignee? However, I cross-validate it by using the url for the above example: url = 'https://www.patentsview.org/api/patents/query?q={"_eq":{"patent_id":"6795487"}}&f=["patent_number","assignee_id","inventor_id","assignee_location_id","inventor_location_id"]',  But this time I find the return results: assignee_location_id = '52.4738|-8.1619', which is different from "ad38f64e-f7dd-4268-9215-971dfdf071fe". Am I missing anything here? Any help and advice is much appreciated, thanks!

 

Best,

Lucy

Russ
Last seen: 12/06/2024 - 09:12
Joined: 11/14/2017 - 22:15
PINNED POST

Hi Lucy,

The first pinned post under Data Quality explains how the raw_ tables are linked.  I think the same applies to the non raw files.  It's a three way join to get from a patent to its inventor's locations.  See https://patentsview.org/forum/8/topic/173 

Note that the forum's url changed over time.  In the thread above you can wind up in older posts that use the original urls.  The links no longer work but can be manually adjusted.  For example:
then: http://www.patentsview.org/community/forum/7/topic/107
 now: https://patentsview.org/forum/7/topic/107

Same thing applies if you use the Search PatentsView Community Site and land in an older post.  

Russ

lucy wang
Last seen: 08/09/2021 - 09:45
Joined: 03/10/2021 - 11:11
FOLLOW-UP QUESTION

Hi Russ,

Thanks very much for the reply! A followed-up question is that why people still need to get access to the raw tables given that we already have the non-raw table? The only reason I see is that maybe the location of inventors and assignees can only be obtained from rawinventor and rawassignee dataset. I was wondering is there any other reasons for turning to raw tables? Thanks again!

Russ
Last seen: 12/06/2024 - 09:12
Joined: 11/14/2017 - 22:15
POSSIBLE FOLLOW-UP ANSWER

Hi Lucy,

The raw data is what was in the patent.  The data load tries to disambiguate the inventors and locations, which is what the non-raw files are.  There are cases where the disambiguation makes mistakes. Someone might want to see the data as it was originally or maybe try to come up with a different disambiguation algorithm.    Here's an example where multiple names are considered the same person.  {"inventor_id":"3933684-2"} If you add patent_number to the f field some but not all of the patents that come back belong to my brother.

Russ

lucy wang
Last seen: 08/09/2021 - 09:45
Joined: 03/10/2021 - 11:11
GOT IT, THANKS! LUCY

Got it, thanks!

 

Lucy