Skip to main content
 
 
 
IN THIS SECTION

Pre-Grant Data Download Dictionary

The pre-grant publications data includes all publications released by the USPTO for download from 2001 through the most recent data update.

pg_granted_pgpubs_crosswalk Table mapping patent application's to their corresponding granted patent numbers
Data Element Name Definition Example Years Present Type
pgpub_id document number 20100273145 all bigint(16)
patent_id patent number 9134302 all varchar(20)
application_id application number 8479995 all varchar(16)
current_pgpub_id_flag Flag indicating whether this row contains the most recent pgpub_id for a given application (TRUE = This pgpub_id refers to the most recent publication of this application; FALSE = This application has been re-published or corrected in a more recent publication) TRUE all boolean
current_patent_id_flag

Flag indicating whether this row contains the most recent patent_id for a given application (TRUE = This patent_id refers to the most recent patent granted to this application; FALSE = This patent has been withdrawn and re-issued under a more recent patent_id)

FALSE all boolean
 
pg_applicant_not_disambiguated US party and applicant information as it appears in the source XML files
Data Element Name Definition Example Years Present Type
pgpub_id document number 20200075150 all bigint(16)
applicant_sequence order in which the party/applicant appears in the source XML file; sequence for this table begins at {1}. 3 all int(11)
applicant_authority The type of interest the applicant has in the intellectual property rights of a patent or application. May be equal to inventor, legal-representative, party-of-interest, obligated-assignee, assignee, or null. legal-representative 2005-present varchar(30)
raw_applicant_name_first first name of the party/applicant, if an individual Janet all varchar(256)
raw_applicant_name_last last name of the party/applicant, if an individual McCallister all varchar(256)
raw_applicant_organization organization of the party/applicant, if applicable FUJIFILM Corporation all varchar(256)
applicant_type type of the party/applicant applicant all varchar(64)
applicant_designation designation of the party/applicant us-only all varchar(32)
rawlocation_id parties' unique location id 0000c51a-0d76-11eb-a344-121df0c29c1e all varchar(128)
 
pg_assignee_disambiguated Disambiguated assignee data. Methodology description can be found here
Data Element Name Definition Example Years Present Type
pgpub_id document number 20080163497 all bigint(16)
assignee_sequence order in which assignee appears in patent file; sequence for this table begins at {1}. 1 all int(11)
assignee_id unique assignee ID generated by the disambiguation algorithm 000009fd-0268-46a4-b493-c8e5f7672331 all varchar(64)
disambig_assignee_individual_name_first first name, if assignee is individual Douglas Norman all varchar(256)
disambig_assignee_individual_name_last last name, if assignee is individual Hartz all varchar(256)
disambig_assignee_organization organization name if assignee is organization BIC-Violex SA all varchar(256)
assignee_type assignee type (1- Unassigned, 2 - US Company or Corporation, 3 - Foreign Company or Corporation, 4 - US Individual, 5 - Foreign Individual, 6 - US Federal Government, 7 - Foreign Government, 8 - US County Government, 9 - US State Government. Note: A "1" appearing before any of these codes signifies part interest) 4 2002 and after int(11)
location_id assignee's location. If the assignee's sequence number is 1, then his/her location is the "location" of the related patent 78b9e85a-3bc6-11eb-a3cd-121df0c29c1e all varchar(128)
 
pg_assignee_not_disambiguated Raw assignee information as it appears in the source XML files
Data Element Name Definition Example Years Present Type
pgpub_id document number 20080163497 all bigint(16)
assignee_sequence order in which assignee appears in patent file; sequence for this table begins at {1}. 1 all int(11)
assignee_id unique assignee ID generated by the disambiguation algorithm 000009fd-0268-46a4-b493-c8e5f7672331 all varchar(64)
raw_assignee_individual_name_first first name, if assignee is individual Douglas Norman all varchar(256)
raw_assignee_individual_name_last last name, if assignee is individual Hartz all varchar(256)
raw_assignee_organization organization name if assignee is organization BIC-Violex SA all varchar(256)
assignee_type assignee type (1- Unassigned, 2 - US Company or Corporation, 3 - Foreign Company or Corporation, 4 - US Individual, 5 - Foreign Individual, 6 - US Federal Government, 7 - Foreign Government, 8 - US County Government, 9 - US State Government. Note: A "1" appearing before any of these codes signifies part interest) 4 2002 and after int(11)
rawlocation_id assignee's location. If the assignee's sequence number is 1, then his/her location is the "location" of the related patent 78b9e85a-3bc6-11eb-a3cd-121df0c29c1e all varchar(128)
 
pg_brf_sum_text_{year} Text of application summary for a given year
Data Element Name Definition Example Years Present Type
pgpub_id document number 20050011018 all bigint(16)
summary_text text of the summary itself The present disclosure is directed .. all mediumtext
 
pg_claims_{year} Full text of application claims, including dependency and sequence
Data Element Name Definition Example Years Present Type
pgpub_id document number 20050010988 all bigint(16)
claim_sequence order in which claims appear in patent file; sequence for this table begins at {1}. 2 all int(11)
claim_text claim text 2 . A garment according to claim 1 , wherein: … all mediumtext
dependent sequence number of claim this is dependent on. NULL if independent claim 1 all varchar(512)
claim_number contains claim number of claim formatted as 0-prefixed 5 digit number 00002 all varchar(128)
 
pg_cpc_at_issue CPC classification at the time the application was submitted
Data Element Name Definition Example Years Present Type
pgpub_id document number 20170047568 all bigint(16)
cpc_sequence order in which the CPC classifications occur in the raw XML file; sequence for this table begins at {1}. 14 all int(11)
cpc_version_indicator date corresponding to the version under which the classification was made 2013-01-01 all date
cpc_section CPC section  (A = Human Necessities, B = Performing Operations; Transporting, C = Chemistry; Metallurgy, D = Textiles; Paper, E = Fixed Constructions, F = Mechanical Engineering; Lighting; Heating; Weapons; Blasting Engines or Pumps, G = Physics, H = Electricity, Y = General Tagging of New Technological Developments) B all varchar(16)
cpc_class CPC section id plus CPC class value B65 all varchar(16)
cpc_subclass CPC group id: https://www.uspto.gov/web/patents/classification/cpc/html/cpc.html B65H all varchar(16)
cpc_group CPC subgroup id: https://www.uspto.gov/web/patents/classification/cpc/html/cpc.html B65H35/06 all varchar(16)
cpc_type Category of CPC classification "Inventional" or "Additional" Inventional all varchar(16)
cpc_action_date Date corresponding to the approval of the CPC classification 2017-02-16 all date
 
pg_cpc_current Current CPC classification of the application
Data Element Name Definition Example Years Present Type
pgpub_id document number 20190113923 all bigint(16)
cpc_sequence order in which the CPC classifications occur in the raw XML file; sequence for this table begins at 0. 27 all int(11)
cpc_version_indicator date corresponding to the version under which the classification was made 2013-01-01 all date
cpc_section CPC section  (A = Human Necessities, B = Performing Operations; Transporting, C = Chemistry; Metallurgy, D = Textiles; Paper, E = Fixed Constructions, F = Mechanical Engineering; Lighting; Heating; Weapons; Blasting Engines or Pumps, G = Physics, H = Electricity, Y = General Tagging of New Technological Developments) Y all varchar(16)
cpc_class CPC section id plus CPC class value Y02 all varchar(16)
cpc_subclass CPC group id: https://www.uspto.gov/web/patents/classification/cpc/html/cpc.html Y02T all varchar(16)
cpc_group CPC subgroup id: https://www.uspto.gov/web/patents/classification/cpc/html/cpc.html Y02T10/7291 all varchar(16)
cpc_type Category of CPC classification "Inventional" or "Additional" additional all varchar(16)
 
pg_cpc_title CPC group classification at issue of the patent
Data Element Name Definition Example Years Present Type
cpc_subclass CPC group id A01B all varchar(20)
cpc_subclass_title description of CPC group SOIL WORKING IN AGRICULTURE OR FORESTRY; all varchar(512)
cpc_group CPC subgroup id A01B1/00 all varchar(20)
cpc_group_title description of CPC subgroup Hand tools all mediumtext
cpc_class CPC subsection id A01 all varchar(20)
cpc_class_title description of CPC subsection AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING all varchar(512)
 
pg_detail_desc_text_{year} Text of detailed description of application for a given year
Data Element Name Definition Example Years Present Type
pgpub_id document number 20080000006 all bigint(16)
description_text full text of the detailed description This view also illustrates optional shoulder… all longtext
description_length length of the text field 72825 all bigint(16)
 
pg_draw_desc_text_{year} Text of description of the application's drawings
Data Element Name Definition Example Years Present Type
pgpub_id document number 20150007372 all bigint(16)
draw_desc_sequence order in which drawing descriptions appear in application file, often the same as the figure id; sequence for this table begins at {1}. 15 all int(11)
draw_desc_text text of the drawing descriptions FIG. 22 is a simplified enlarged view… all mediumtext
 
pg_foreign_priority Information about an earlier patent filing in a foreign country which gives the claim priority 
Data Element Name Definition Example Years Present Type
pgpub_id document number 20110296671 all bigint(16)
priority_claim_sequence order in which priority claims appear in publication; sequence for this table begins at {1} 1 all int(11)
priority_claim_kind type of priority claim (international, national, regional) national 2005 and after varchar(20)
foreign_application_id foreign patent application number 93114223 all varchar(256)
filing_date date on which the foreign document was filed 2004-05-20 2002 and after date
foreign_country_filed country in which the foreign document was filed TW all varchar(256)
 
pg_gov_interest Mapping of patent numbers to raw government interest text
Data Element Name Definition Example Years Present Type
pgpub_id document number 20100273145 all bigint(16)
gi_statement raw government interest text The invention herein described was made in the course of work under a grant or award from the U.S. Army. all mediumtext
 
pg_gov_interest_contracts Mapping of Federal contract award numbers to patent numbers
Data Element Name Definition Example Years Present Type
pgpub_id patent number 3931132 all varchar(20)
contract_award_number federal contract award number 83-568 73 all varchar(255)
 
pg_gov_interest_org Federal agencies with government interests in patents
Data Element Name Definition Example Years Present Type
pgpub_id patent number 8636942 all varchar(255)
gi_organization_id unique organization id 1 all int(11)
fedagency_name name of Federal department, agency, bureau, etc. Department of Veterans Affairs all varchar(255)
level_one position of the focal agency in a hierarchical set of relationships with the parent agency (e.g., DHHS) at level_one and child agencies at lower levels (e.g., NIH at level_two and NIDA at level_three). government_organization.name is equal to one and only one of the values in level_one, level_two, and level_three Department of Veterans Affairs all varchar(255)
level_two position of the focal agency in a hierarchical set of relationships with the parent agency (e.g., DHHS) at level_one and child agencies at lower levels (e.g., NIH at level_two and NIDA at level_three). government_organization.name is equal to one and only one of the values in level_one, level_two, and level_three U.S. Census Bureau all varchar(255)
level_three position of the focal agency in a hierarchical set of relationships with the parent agency (e.g., DHHS) at level_one and child agencies at lower levels (e.g., NIH at level_two and NIDA at level_three). government_organization.name is equal to one and only one of the values in level_one, level_two, and level_three Advanced Technology Program all varchar(255)
 
pg_inventor_disambiguated Disambiguated inventor data. Methodology description can be found here
Data Element Name Definition Example Years Present Type
pgpub_id document number 20150287775 all bigint(16)
inventor_sequence order in which the inventor appears in patent file; sequence for this table begins at {0}. 2 all int(11)
inventor_id unique inventor id generated by the disambiguation algorithm fl:n_ln:benichou-1 all varchar(64)
disambig_inventor_name_first first name of the inventor Shigeharu all varchar(256)
disambig_inventor_name_last last name of the inventor Yamagami all varchar(256)
gender_code inventor attributed gender (F = female, M = male, U = no gender attributed) U all varchar(1)
location_id inventor's location. If the inventor's sequence number is 0, then his/her location is the "location" of the related patent 00000249-bdce-11ea-8a73-121df0c29c1e all varchar(128)
 
pg_inventor_not_disambiguated Raw inventor information as it appears in the source XML files
Data Element Name Definition Example Years Present Type
pgpub_id document number 20150287775 all bigint(16)
inventor_sequence order in which the inventor appears in patent file; sequence for this table begins at {0}. 2 all int(11)
inventor_id unique inventor id generated by the disambiguation algorithm fl:n_ln:benichou-1 all varchar(64)
raw_inventor_name_first first name of the inventor Shigeharu all varchar(256)
raw_inventor_name_last last name of the inventor Yamagami all varchar(256)
deceased_flag flag determining whether the inventor is deceased or not FALSE all boolean
rawlocation_id inventor's location. If the inventor's sequence number is 0, then his/her location is the "location" of the related patent 00000249-bdce-11ea-8a73-121df0c29c1e all varchar(128)
 
pg_ipc_at_issue International Patent Classification
Data Element Name Definition Example Years Present Type
pgpub_id patent number 20080166814 all varchar(20)
ipc_sequence order in which IPC class appears in patent file; sequence for this table begins at {1}. 2 all int(11)
classification_level IPC classification level (A = advanced level)  A 2006 and after varchar(256)
section IPC section (A = Human Necessities, B = Performing Operations; Transporting, C = Chemistry; Metallurgy, D = Textiles; Paper, E = Fixed Constructions, F = Mechanical Engineering; Lighting; Heating; Weapons; Blasting, G = Physics, H = Electricity) G all varchar(256)
ipc_class IPC class 01 all varchar(256)
subclass IPC subclass N all varchar(256)
main_group IPC group 33 all varchar(256)
subgroup IPC subgroup 00 all varchar(256)
classification_value IPC classification value ( I = invention information; N = non-invention information) I 2006 and after varchar(256)
classification_status IPC classification status ( B = Basic or Original) B 2006 and after varchar(256)
classification_data_source IPC classification data source ( H = Human - Generated; M = Machine - Generated; G = Generated via Software) H all varchar(256)
action_date issue date of the patent grant 2008-07-10 2006 and after date
ipc_version_indicator IPC version indicator 2006-01-01 2006 and after date
 
pg_location_disambiguated Disambiguated location data for inventors, assignees, and applicants
Data Element Name Definition Example Years Present Type
location_id unique location id generated by the disambiguation algorithm 0009wn7out97 all varchar(128)
disambig_city city associated with a location_id Stewart all varchar(128)
disambig_state state associated with a location_id MN all varchar(20)
disambig_country country associated with a location_id US all varchar(16)
latitude latitude for location_id 44.7247 all float
longitude longitude for location_id -94.4858 all float
county county associated with location_id Mcleod all varchar(60)
state_fips state FIPS code associated with a location_id 27 all varchar(2)
county_fips county FIPS code associated with a location_id 85 all varchar(6)
 
pg_location_not_disambiguated Raw location information as it appears in the source XML files
Data Element Name Definition Example Years Present Type
rawlocation_id unique id 000035ewn5fb3oerojqwcw4hg all varchar(128)
location_id unique location id generated by the disambiguation algorithm q6d41gon4fey all varchar(128)
raw_city city associated with a location_id Colorado Springs all varchar(128)
raw_state state associated with a location_id CO all varchar(20)
raw_country country associated with a location_id US all varchar(10)
 
pg_pct_data PCT or regional filing data
Data Element Name Definition Example Years Present Type
pgpub_id document number 20080163436 all varchar(20)
published_or_filed_date date published or filed 2005-11-21 all date
pct_371_date PCT Section 371(c)(1)(2)(4)date, date when pct application was filed 2007-07-16 all date
pct_102_date 35 U.S.C. § 102(e) date for pct applications 2015-08-11 2001 and after date
filed_country country filed in WO 2002 and after varchar(256)
application_kind kind of application (numerical for filed, alphabetical for published) 00 2002 and after varchar(20)
pct_doc_number pct document number PCT/EP2005/056103 all varchar(128)
pct_doc_type whether the document has been published or just filed pct_application 2002 and after varchar(20)
 
pg_persistent_assignee Disambiguated Assignee ID across data updates
Data Element Name Definition Example Years Present Type
pgpub_id document number 20060263593 all bigint(16)

disamb_assignee_id_xxxxxxxx

the disambiguated assignee ids associated with the raw assignee on a given pgpub document with the given sequence during the data updates matching the date stamps in the field names. Data on this table goes back to the 20201229 update. 000009fd-0268-46a4-b493-c8e5f7672331 all varchar(64)
assignee_sequence order in which assignee appears in application file; sequence for this table begins at {1}. 2 all int(11)
 
pg_persistent_inventor Disambiguated Inventor ID across data updates
Data Element Name Definition Example Years Present Type
pgpub_id document number 20060263593 all bigint(16)
disamb_inventor_id_xxxxxxxx the disambiguated inventor ids associated with the raw inventor on a given pgpub document with the given sequence during the data updates matching the date stamps in the field names. Data on this table goes back to the 20210330 update. fl:n_ln:benichou-1 all varchar(64)
inventor_sequence order in which inventor appears in application file; sequence for this table begins at {0}. 3 all int(11)
 
pg_published_application Data concerning pre-granted patent applications
Data Element Name Definition Example Years Present Type
pgpub_id document number 20100071101 all bigint(16)
application_id unique application identifying number 12283848 all varchar(16)
filing_date date of application filing 2008-09-15 all date 
patent_type category of application ('utility' or 'plant') plant all varchar(20)
filing_type type of publication filing us-republication-corrected all varchar(128)
published_date date of publication 2014-07-31 all date
wipo_kind kind code of publication: https://www.uspto.gov/learning-and-resources/support-centers/electronic-business-center/kind-codes-included-uspto-patent A9 all varchar(8)
series_code unique application identifying number 12 all int(11)
application_title title of application Japanese maple plant named 'King's Blood' all mediumtext
application_abstract abstract of application  A new and distinct cultivar of Japanese … all mediumtext
rule_47_flag flag for inventor who was unable to be contacted at filing of application FALSE all varchar(8)
filename name of XML file from which the data was parsed ipa100318.xml all varchar(16)
 
pg_rel_app_text Description of related patent applications
Data Element Name Definition Example Years Present Type
pgpub_id document number 20200053967 all bigint(16)
rel_app_text text of the description of the related application in question This application claims the benefit … all mediumtext
 

pg_us_rel_doc

US Related Documents for patent applications
Data Element Name Definition Example Years Present Type
pgpub_id patent number 9683848 2005 and after varchar(20)
related_doc_type defines the type of documentation continuation-in-part 2005 and after varchar(64)
related_doc_kind kind of document (text) parent document 2005 and after varchar(64)
related_doc_number related document number 13336060 2005 and after varchar(64)
published_country country where related document was published US 2005 and after varchar(20)
related_doc_published_date publication date of the related document 12/23/2011 2005 and after date
related_doc_sequence order in which the related document appears in patent file; sequence for this table begins at {1}. 36 2005 and after int(11)
 
pg_uspc_at_issue USPTO patent classification at patent issue date
Data Element Name Definition Example Years Present Type
pgpub_id document number 20060263593 all (incomplete for 2002-2004) bigint(16)
uspc_sequence order in which uspc class appears in patent file; sequence for this table begins at {1}. 7 all (incomplete for 2002-2004) int(11)
uspc_mainclass_id uspc mainclass at issue 313 all (incomplete for 2002-2004) varchar(16)
uspc_mainclass_title description of uspc mainclass PRESSES all (incomplete for 2002-2004) varchar(256)
uspc_subclass_id uspc subclass at issue 313/116000 all (incomplete for 2002-2004) varchar(16)
uspc_subclass_title description of uspc subclass BINDING264 all (incomplete for 2002-2004) varchar(512)
 
pg_wipo_technology WIPO technology classification of the patent
Data Element Name Definition Example Years Present Type
pgpub_id document number 3930271 all varchar(20)
wipo_field_sequence order in which WIPO technology field appears on patent; sequence for this table begins at {0}. 0 all bigint(20)
wipo_field_id WIPO technology field id as derived from crosswalk http://www.wipo.int/export/sites/www/ipstats/en/statistics/patents/xls/ipc_technology.xls 33 all double
wipo_sector_title WIPO technology sector title Electrical engineering all varchar(60)
wipo_field_title WIPO technology field title Electrical machinery, apparatus, energy all varchar(255)