- About
- Methods & Sources
- Community
Pre-Grant Data Download Dictionary
The pre-grant publications data includes all publications released by the USPTO for download from 2001 through the most recent data update.
pg_granted_pgpubs_crosswalk | Table mapping patent application's to their corresponding granted patent numbers | |||
Data Element Name | Definition | Example | Years Present | Type |
pgpub_id | document number | 20100273145 | all | bigint(16) |
patent_id | patent number | 9134302 | all | varchar(20) |
application_id | application number | 8479995 | all | varchar(16) |
current_pgpub_id_flag | Flag indicating whether this row contains the most recent pgpub_id for a given application (TRUE = This pgpub_id refers to the most recent publication of this application; FALSE = This application has been re-published or corrected in a more recent publication) | TRUE | all | boolean |
current_patent_id_flag |
Flag indicating whether this row contains the most recent patent_id for a given application (TRUE = This patent_id refers to the most recent patent granted to this application; FALSE = This patent has been withdrawn and re-issued under a more recent patent_id) |
FALSE | all | boolean |
pg_applicant_not_disambiguated | US party and applicant information as it appears in the source XML files | |||
Data Element Name | Definition | Example | Years Present | Type |
pgpub_id | document number | 20200075150 | all | bigint(16) |
applicant_sequence | order in which the party/applicant appears in the source XML file; sequence for this table begins at {1}. | 3 | all | int(11) |
applicant_authority | The type of interest the applicant has in the intellectual property rights of a patent or application. May be equal to inventor, legal-representative, party-of-interest, obligated-assignee, assignee, or null. | legal-representative | 2005-present | varchar(30) |
raw_applicant_name_first | first name of the party/applicant, if an individual | Janet | all | varchar(256) |
raw_applicant_name_last | last name of the party/applicant, if an individual | McCallister | all | varchar(256) |
raw_applicant_organization | organization of the party/applicant, if applicable | FUJIFILM Corporation | all | varchar(256) |
applicant_type | type of the party/applicant | applicant | all | varchar(64) |
applicant_designation | designation of the party/applicant | us-only | all | varchar(32) |
rawlocation_id | parties' unique location id | 0000c51a-0d76-11eb-a344-121df0c29c1e | all | varchar(128) |
pg_assignee_disambiguated | Disambiguated assignee data. Methodology description can be found here | |||
Data Element Name | Definition | Example | Years Present | Type |
pgpub_id | document number | 20080163497 | all | bigint(16) |
assignee_sequence | order in which assignee appears in patent file; sequence for this table begins at {1}. | 1 | all | int(11) |
assignee_id | unique assignee ID generated by the disambiguation algorithm | 000009fd-0268-46a4-b493-c8e5f7672331 | all | varchar(64) |
disambig_assignee_individual_name_first | first name, if assignee is individual | Douglas Norman | all | varchar(256) |
disambig_assignee_individual_name_last | last name, if assignee is individual | Hartz | all | varchar(256) |
disambig_assignee_organization | organization name if assignee is organization | BIC-Violex SA | all | varchar(256) |
assignee_type | assignee type (1- Unassigned, 2 - US Company or Corporation, 3 - Foreign Company or Corporation, 4 - US Individual, 5 - Foreign Individual, 6 - US Federal Government, 7 - Foreign Government, 8 - US County Government, 9 - US State Government. Note: A "1" appearing before any of these codes signifies part interest) | 4 | 2002 and after | int(11) |
location_id | assignee's location. If the assignee's sequence number is 1, then his/her location is the "location" of the related patent | 78b9e85a-3bc6-11eb-a3cd-121df0c29c1e | all | varchar(128) |
pg_assignee_not_disambiguated | Raw assignee information as it appears in the source XML files | |||
Data Element Name | Definition | Example | Years Present | Type |
pgpub_id | document number | 20080163497 | all | bigint(16) |
assignee_sequence | order in which assignee appears in patent file; sequence for this table begins at {1}. | 1 | all | int(11) |
assignee_id | unique assignee ID generated by the disambiguation algorithm | 000009fd-0268-46a4-b493-c8e5f7672331 | all | varchar(64) |
raw_assignee_individual_name_first | first name, if assignee is individual | Douglas Norman | all | varchar(256) |
raw_assignee_individual_name_last | last name, if assignee is individual | Hartz | all | varchar(256) |
raw_assignee_organization | organization name if assignee is organization | BIC-Violex SA | all | varchar(256) |
assignee_type | assignee type (1- Unassigned, 2 - US Company or Corporation, 3 - Foreign Company or Corporation, 4 - US Individual, 5 - Foreign Individual, 6 - US Federal Government, 7 - Foreign Government, 8 - US County Government, 9 - US State Government. Note: A "1" appearing before any of these codes signifies part interest) | 4 | 2002 and after | int(11) |
rawlocation_id | assignee's location. If the assignee's sequence number is 1, then his/her location is the "location" of the related patent | 78b9e85a-3bc6-11eb-a3cd-121df0c29c1e | all | varchar(128) |
pg_brf_sum_text_{year} | Text of application summary for a given year | |||
Data Element Name | Definition | Example | Years Present | Type |
pgpub_id | document number | 20050011018 | all | bigint(16) |
summary_text | text of the summary itself | The present disclosure is directed .. | all | mediumtext |
pg_claims_{year} | Full text of application claims, including dependency and sequence | |||
Data Element Name | Definition | Example | Years Present | Type |
pgpub_id | document number | 20050010988 | all | bigint(16) |
claim_sequence | order in which claims appear in patent file; sequence for this table begins at {1}. | 2 | all | int(11) |
claim_text | claim text | 2 . A garment according to claim 1 , wherein: … | all | mediumtext |
dependent | sequence number of claim this is dependent on. NULL if independent | claim 1 | all | varchar(512) |
claim_number | contains claim number of claim formatted as 0-prefixed 5 digit number | 00002 | all | varchar(128) |
pg_cpc_at_issue | CPC classification at the time the application was submitted | |||
Data Element Name | Definition | Example | Years Present | Type |
pgpub_id | document number | 20170047568 | all | bigint(16) |
cpc_sequence | order in which the CPC classifications occur in the raw XML file; sequence for this table begins at {1}. | 14 | all | int(11) |
cpc_version_indicator | date corresponding to the version under which the classification was made | 2013-01-01 | all | date |
cpc_section | CPC section (A = Human Necessities, B = Performing Operations; Transporting, C = Chemistry; Metallurgy, D = Textiles; Paper, E = Fixed Constructions, F = Mechanical Engineering; Lighting; Heating; Weapons; Blasting Engines or Pumps, G = Physics, H = Electricity, Y = General Tagging of New Technological Developments) | B | all | varchar(16) |
cpc_class | CPC section id plus CPC class value | B65 | all | varchar(16) |
cpc_subclass | CPC group id: https://www.uspto.gov/web/patents/classification/cpc/html/cpc.html | B65H | all | varchar(16) |
cpc_group | CPC subgroup id: https://www.uspto.gov/web/patents/classification/cpc/html/cpc.html | B65H35/06 | all | varchar(16) |
cpc_type | Category of CPC classification "Inventional" or "Additional" | Inventional | all | varchar(16) |
cpc_action_date | Date corresponding to the approval of the CPC classification | 2017-02-16 | all | date |
pg_cpc_current | Current CPC classification of the application | |||
Data Element Name | Definition | Example | Years Present | Type |
pgpub_id | document number | 20190113923 | all | bigint(16) |
cpc_sequence | order in which the CPC classifications occur in the raw XML file; sequence for this table begins at 0. | 27 | all | int(11) |
cpc_version_indicator | date corresponding to the version under which the classification was made | 2013-01-01 | all | date |
cpc_section | CPC section (A = Human Necessities, B = Performing Operations; Transporting, C = Chemistry; Metallurgy, D = Textiles; Paper, E = Fixed Constructions, F = Mechanical Engineering; Lighting; Heating; Weapons; Blasting Engines or Pumps, G = Physics, H = Electricity, Y = General Tagging of New Technological Developments) | Y | all | varchar(16) |
cpc_class | CPC section id plus CPC class value | Y02 | all | varchar(16) |
cpc_subclass | CPC group id: https://www.uspto.gov/web/patents/classification/cpc/html/cpc.html | Y02T | all | varchar(16) |
cpc_group | CPC subgroup id: https://www.uspto.gov/web/patents/classification/cpc/html/cpc.html | Y02T10/7291 | all | varchar(16) |
cpc_type | Category of CPC classification "Inventional" or "Additional" | additional | all | varchar(16) |
pg_cpc_title | CPC group classification at issue of the patent | |||
Data Element Name | Definition | Example | Years Present | Type |
cpc_subclass | CPC group id | A01B | all | varchar(20) |
cpc_subclass_title | description of CPC group | SOIL WORKING IN AGRICULTURE OR FORESTRY; | all | varchar(512) |
cpc_group | CPC subgroup id | A01B1/00 | all | varchar(20) |
cpc_group_title | description of CPC subgroup | Hand tools | all | mediumtext |
cpc_class | CPC subsection id | A01 | all | varchar(20) |
cpc_class_title | description of CPC subsection | AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING | all | varchar(512) |
pg_detail_desc_text_{year} | Text of detailed description of application for a given year | |||
Data Element Name | Definition | Example | Years Present | Type |
pgpub_id | document number | 20080000006 | all | bigint(16) |
description_text | full text of the detailed description | This view also illustrates optional shoulder… | all | longtext |
description_length | length of the text field | 72825 | all | bigint(16) |
pg_draw_desc_text_{year} | Text of description of the application's drawings | |||
Data Element Name | Definition | Example | Years Present | Type |
pgpub_id | document number | 20150007372 | all | bigint(16) |
draw_desc_sequence | order in which drawing descriptions appear in application file, often the same as the figure id; sequence for this table begins at {1}. | 15 | all | int(11) |
draw_desc_text | text of the drawing descriptions | FIG. 22 is a simplified enlarged view… | all | mediumtext |
pg_foreign_priority | Information about an earlier patent filing in a foreign country which gives the claim priority | |||
Data Element Name | Definition | Example | Years Present | Type |
pgpub_id | document number | 20110296671 | all | bigint(16) |
priority_claim_sequence | order in which priority claims appear in publication; sequence for this table begins at {1} | 1 | all | int(11) |
priority_claim_kind | type of priority claim (international, national, regional) | national | 2005 and after | varchar(20) |
foreign_application_id | foreign patent application number | 93114223 | all | varchar(256) |
filing_date | date on which the foreign document was filed | 2004-05-20 | 2002 and after | date |
foreign_country_filed | country in which the foreign document was filed | TW | all | varchar(256) |
pg_gov_interest | Mapping of patent numbers to raw government interest text | |||
Data Element Name | Definition | Example | Years Present | Type |
pgpub_id | document number | 20100273145 | all | bigint(16) |
gi_statement | raw government interest text | The invention herein described was made in the course of work under a grant or award from the U.S. Army. | all | mediumtext |
pg_gov_interest_contracts | Mapping of Federal contract award numbers to patent numbers | |||
Data Element Name | Definition | Example | Years Present | Type |
pgpub_id | patent number | 3931132 | all | varchar(20) |
contract_award_number | federal contract award number | 83-568 73 | all | varchar(255) |
pg_gov_interest_org | Federal agencies with government interests in patents | |||
Data Element Name | Definition | Example | Years Present | Type |
pgpub_id | patent number | 8636942 | all | varchar(255) |
gi_organization_id | unique organization id | 1 | all | int(11) |
fedagency_name | name of Federal department, agency, bureau, etc. | Department of Veterans Affairs | all | varchar(255) |
level_one | position of the focal agency in a hierarchical set of relationships with the parent agency (e.g., DHHS) at level_one and child agencies at lower levels (e.g., NIH at level_two and NIDA at level_three). government_organization.name is equal to one and only one of the values in level_one, level_two, and level_three | Department of Veterans Affairs | all | varchar(255) |
level_two | position of the focal agency in a hierarchical set of relationships with the parent agency (e.g., DHHS) at level_one and child agencies at lower levels (e.g., NIH at level_two and NIDA at level_three). government_organization.name is equal to one and only one of the values in level_one, level_two, and level_three | U.S. Census Bureau | all | varchar(255) |
level_three | position of the focal agency in a hierarchical set of relationships with the parent agency (e.g., DHHS) at level_one and child agencies at lower levels (e.g., NIH at level_two and NIDA at level_three). government_organization.name is equal to one and only one of the values in level_one, level_two, and level_three | Advanced Technology Program | all | varchar(255) |
pg_inventor_disambiguated | Disambiguated inventor data. Methodology description can be found here | |||
Data Element Name | Definition | Example | Years Present | Type |
pgpub_id | document number | 20150287775 | all | bigint(16) |
inventor_sequence | order in which the inventor appears in patent file; sequence for this table begins at {0}. | 2 | all | int(11) |
inventor_id | unique inventor id generated by the disambiguation algorithm | fl:n_ln:benichou-1 | all | varchar(64) |
disambig_inventor_name_first | first name of the inventor | Shigeharu | all | varchar(256) |
disambig_inventor_name_last | last name of the inventor | Yamagami | all | varchar(256) |
gender_code | inventor attributed gender (F = female, M = male, U = no gender attributed) | U | all | varchar(1) |
location_id | inventor's location. If the inventor's sequence number is 0, then his/her location is the "location" of the related patent | 00000249-bdce-11ea-8a73-121df0c29c1e | all | varchar(128) |
pg_inventor_not_disambiguated | Raw inventor information as it appears in the source XML files | |||
Data Element Name | Definition | Example | Years Present | Type |
pgpub_id | document number | 20150287775 | all | bigint(16) |
inventor_sequence | order in which the inventor appears in patent file; sequence for this table begins at {0}. | 2 | all | int(11) |
inventor_id | unique inventor id generated by the disambiguation algorithm | fl:n_ln:benichou-1 | all | varchar(64) |
raw_inventor_name_first | first name of the inventor | Shigeharu | all | varchar(256) |
raw_inventor_name_last | last name of the inventor | Yamagami | all | varchar(256) |
deceased_flag | flag determining whether the inventor is deceased or not | FALSE | all | boolean |
rawlocation_id | inventor's location. If the inventor's sequence number is 0, then his/her location is the "location" of the related patent | 00000249-bdce-11ea-8a73-121df0c29c1e | all | varchar(128) |
pg_ipc_at_issue | International Patent Classification | |||
Data Element Name | Definition | Example | Years Present | Type |
pgpub_id | patent number | 20080166814 | all | varchar(20) |
ipc_sequence | order in which IPC class appears in patent file; sequence for this table begins at {1}. | 2 | all | int(11) |
classification_level | IPC classification level (A = advanced level) | A | 2006 and after | varchar(256) |
section | IPC section (A = Human Necessities, B = Performing Operations; Transporting, C = Chemistry; Metallurgy, D = Textiles; Paper, E = Fixed Constructions, F = Mechanical Engineering; Lighting; Heating; Weapons; Blasting, G = Physics, H = Electricity) | G | all | varchar(256) |
ipc_class | IPC class | 01 | all | varchar(256) |
subclass | IPC subclass | N | all | varchar(256) |
main_group | IPC group | 33 | all | varchar(256) |
subgroup | IPC subgroup | 00 | all | varchar(256) |
classification_value | IPC classification value ( I = invention information; N = non-invention information) | I | 2006 and after | varchar(256) |
classification_status | IPC classification status ( B = Basic or Original) | B | 2006 and after | varchar(256) |
classification_data_source | IPC classification data source ( H = Human - Generated; M = Machine - Generated; G = Generated via Software) | H | all | varchar(256) |
action_date | issue date of the patent grant | 2008-07-10 | 2006 and after | date |
ipc_version_indicator | IPC version indicator | 2006-01-01 | 2006 and after | date |
pg_location_disambiguated | Disambiguated location data for inventors, assignees, and applicants | |||
Data Element Name | Definition | Example | Years Present | Type |
location_id | unique location id generated by the disambiguation algorithm | 0009wn7out97 | all | varchar(128) |
disambig_city | city associated with a location_id | Stewart | all | varchar(128) |
disambig_state | state associated with a location_id | MN | all | varchar(20) |
disambig_country | country associated with a location_id | US | all | varchar(16) |
latitude | latitude for location_id | 44.7247 | all | float |
longitude | longitude for location_id | -94.4858 | all | float |
county | county associated with location_id | Mcleod | all | varchar(60) |
state_fips | state FIPS code associated with a location_id | 27 | all | varchar(2) |
county_fips | county FIPS code associated with a location_id | 85 | all | varchar(6) |
pg_location_not_disambiguated | Raw location information as it appears in the source XML files | |||
Data Element Name | Definition | Example | Years Present | Type |
rawlocation_id | unique id | 000035ewn5fb3oerojqwcw4hg | all | varchar(128) |
location_id | unique location id generated by the disambiguation algorithm | q6d41gon4fey | all | varchar(128) |
raw_city | city associated with a location_id | Colorado Springs | all | varchar(128) |
raw_state | state associated with a location_id | CO | all | varchar(20) |
raw_country | country associated with a location_id | US | all | varchar(10) |
pg_pct_data | PCT or regional filing data | |||
Data Element Name | Definition | Example | Years Present | Type |
pgpub_id | document number | 20080163436 | all | varchar(20) |
published_or_filed_date | date published or filed | 2005-11-21 | all | date |
pct_371_date | PCT Section 371(c)(1)(2)(4)date, date when pct application was filed | 2007-07-16 | all | date |
pct_102_date | 35 U.S.C. § 102(e) date for pct applications | 2015-08-11 | 2001 and after | date |
filed_country | country filed in | WO | 2002 and after | varchar(256) |
application_kind | kind of application (numerical for filed, alphabetical for published) | 00 | 2002 and after | varchar(20) |
pct_doc_number | pct document number | PCT/EP2005/056103 | all | varchar(128) |
pct_doc_type | whether the document has been published or just filed | pct_application | 2002 and after | varchar(20) |
pg_persistent_assignee | Disambiguated Assignee ID across data updates | |||
Data Element Name | Definition | Example | Years Present | Type |
pgpub_id | document number | 20060263593 | all | bigint(16) |
disamb_assignee_id_xxxxxxxx |
the disambiguated assignee ids associated with the raw assignee on a given pgpub document with the given sequence during the data updates matching the date stamps in the field names. Data on this table goes back to the 20201229 update. | 000009fd-0268-46a4-b493-c8e5f7672331 | all | varchar(64) |
assignee_sequence | order in which assignee appears in application file; sequence for this table begins at {1}. | 2 | all | int(11) |
pg_persistent_inventor | Disambiguated Inventor ID across data updates | |||
Data Element Name | Definition | Example | Years Present | Type |
pgpub_id | document number | 20060263593 | all | bigint(16) |
disamb_inventor_id_xxxxxxxx | the disambiguated inventor ids associated with the raw inventor on a given pgpub document with the given sequence during the data updates matching the date stamps in the field names. Data on this table goes back to the 20210330 update. | fl:n_ln:benichou-1 | all | varchar(64) |
inventor_sequence | order in which inventor appears in application file; sequence for this table begins at {0}. | 3 | all | int(11) |
pg_published_application | Data concerning pre-granted patent applications | |||
Data Element Name | Definition | Example | Years Present | Type |
pgpub_id | document number | 20100071101 | all | bigint(16) |
application_id | unique application identifying number | 12283848 | all | varchar(16) |
filing_date | date of application filing | 2008-09-15 | all | date |
patent_type | category of application ('utility' or 'plant') | plant | all | varchar(20) |
filing_type | type of publication filing | us-republication-corrected | all | varchar(128) |
published_date | date of publication | 2014-07-31 | all | date |
wipo_kind | kind code of publication: https://www.uspto.gov/learning-and-resources/support-centers/electronic-business-center/kind-codes-included-uspto-patent | A9 | all | varchar(8) |
series_code | unique application identifying number | 12 | all | int(11) |
application_title | title of application | Japanese maple plant named 'King's Blood' | all | mediumtext |
rule_47_flag | flag for inventor who was unable to be contacted at filing of application | FALSE | all | varchar(8) |
filename | name of XML file from which the data was parsed | ipa100318.xml | all | varchar(16) |
pg_rel_app_text | Description of related patent applications | |||
Data Element Name | Definition | Example | Years Present | Type |
pgpub_id | document number | 20200053967 | all | bigint(16) |
rel_app_text | text of the description of the related application in question | This application claims the benefit … | all | mediumtext |
pg_published_application_abstract | Data concerning abstracts of pre-granted patent applications | |||
Data Element Name | Definition | Example | Years Present | Type |
pgpub_id | document number | 20100071101 | all | bigint(16) |
application_abstract | abstract of application | A new and distinct cultivar of Japanese … | all | mediumtext |
pg_us_rel_doc |
US Related Documents for patent applications | |||
Data Element Name | Definition | Example | Years Present | Type |
pgpub_id | patent number | 9683848 | 2005 and after | varchar(20) |
related_doc_type | defines the type of documentation | continuation-in-part | 2005 and after | varchar(64) |
related_doc_kind | kind of document (text) | parent document | 2005 and after | varchar(64) |
related_doc_number | related document number | 13336060 | 2005 and after | varchar(64) |
published_country | country where related document was published | US | 2005 and after | varchar(20) |
related_doc_published_date | publication date of the related document | 12/23/2011 | 2005 and after | date |
related_doc_sequence | order in which the related document appears in patent file; sequence for this table begins at {1}. | 36 | 2005 and after | int(11) |
pg_uspc_at_issue | USPTO patent classification at patent issue date | |||
Data Element Name | Definition | Example | Years Present | Type |
pgpub_id | document number | 20060263593 | all (incomplete for 2002-2004) | bigint(16) |
uspc_sequence | order in which uspc class appears in patent file; sequence for this table begins at {1}. | 7 | all (incomplete for 2002-2004) | int(11) |
uspc_mainclass_id | uspc mainclass at issue | 313 | all (incomplete for 2002-2004) | varchar(16) |
uspc_mainclass_title | description of uspc mainclass | PRESSES | all (incomplete for 2002-2004) | varchar(256) |
uspc_subclass_id | uspc subclass at issue | 313/116000 | all (incomplete for 2002-2004) | varchar(16) |
uspc_subclass_title | description of uspc subclass | BINDING264 | all (incomplete for 2002-2004) | varchar(512) |
pg_wipo_technology | WIPO technology classification of the patent | |||
Data Element Name | Definition | Example | Years Present | Type |
pgpub_id | document number | 3930271 | all | varchar(20) |
wipo_field_sequence | order in which WIPO technology field appears on patent; sequence for this table begins at {0}. | 0 | all | bigint(20) |
wipo_field_id | WIPO technology field id as derived from crosswalk http://www.wipo.int/export/sites/www/ipstats/en/statistics/patents/xls/ipc_technology.xls | 33 | all | double |
wipo_sector_title | WIPO technology sector title | Electrical engineering | all | varchar(60) |
wipo_field_title | WIPO technology field title | Electrical machinery, apparatus, energy | all | varchar(255) |