Skip to main content
 
 
 
IN THIS SECTION
2 posts
Sanjana
Last seen: 09/05/2023 - 05:28
Joined: 06/28/2021 - 07:28
Interpreting inventor_sequence = 0?

The pg_inventor dataset has almost 1139852 observations where the inventor_sequence is 0. I was wondering what this means?

The "Pre-Grant Data Download Dictionary" defines inventor_sequence as the "order in which the inventor appears in patent file". Intuitively, this  would mean that inventors with a inventor_sequence of 0 do not appear at all, but that information would be redundant. I would be grateful to hear from anyone who knows more about this.

Thank you!

PVTeam
Role: moderator
Last seen: 04/24/2024 - 12:31
Joined: 10/17/2017 - 10:47
INTERPRETING INVENTOR_SEQUENCE = 0

Hi Sanjana

Our data tables that contain a sequence represent the order of records for a patent or publication using one of two formats:

Some tables begin with 0 indicating the first listed record, 1 indicating the second, and so forth. This is the standard for most coding languages, such as python.

Some other tables, including the pregrant raw assignee table and the granted applicant table, begin their sequence values with 1 indicating the first listed inventor, 2 indicating the second, and so forth. This is typical of a smaller number of coding and statistical languages, such as R, and as you note, is more intuitive.

in most cases these different sequence baselines reflect differences in the Patent Office’s source data.

a small number of our tables currently include a shift between these two strategies; In the currently published pre-grant inventor table, records are indexed from 1 up through 2020-06-25, and indexed from 0 beginning with 2020-07-02. 

we are currently ensuring table-internal consistency for our next update, and are considering establishing cross-table uniformity in the future. 

Beginning with our next data release, we plan to explicitly state the starting sequence value for each table in our granted and pregrant data dictionaries.

Best,
PVTeam