Skip to main content
 
 
 
IN THIS SECTION

Pre-Grant Data Download Dictionary

The pre-grant publications data includes all publications released by the USPTO for download from 2005 through the most recent data update.

granted_patent_crosswalk Table mapping patent application's to their corresponding granted patent numbers
Data Element Name Definition Example Years Present Type
id unique id  c0f8689d-e944-11ea-a344-121df0c29c1e all varchar(128)
document_number document number 20100273145 all bigint(16)
patent_number patent number 9134302 all varchar(20)
application_number application number 8479995 all varchar(16)
 
application Data concering pregranted patent applications
Data Element Name Definition Example Years Present Type
id unique id 0008dcaa-ccf9-11ea-ba95-121df0c29c1e all varchar(128)
document number document number 20100071101 all bigint(16)
type category of application ('utility' or 'plant') plant all varchar(20)
application_number unique applicaiton identifying number 12283848 all varchar(16
date date of application fililing 2008-09-15 all date 
country country this application was filed in US all varchar(128)
series_code unique applicaiton identifying number 12 all int(11)
invention_title title of invention Japanese maple plant named 'King's Blood' all mediumtext
invention_abstract abstract of invention  A new and distinct cultivar of Japanese … all mediumtext
rule_47_flag flag for inventor who was unable to be contacted at filing of application FALSE all varchar(8)
filename name of XML file from which the data was parsed ipa100318.xml all varchar(16)
 
brf_sum_text_{year} Text of application summary for a given year
Data Element Name Definition Example Years Present Type
id unique id 141fe23f-d1b8-11ea-ba95-121df0c29c1e all varchar(512)
document number document number 20050011018 all bigint(16)
text text of the summary itself The present disclosure is directed .. all mediumtext
 
claim_{year} Full text of application claims, including dependency and sequence
Data Element Name Definition Example Years Present Type
id unique id 16b5106a-d1b8-11ea-ba95-121df0c29c1e all varchar(512)
document number document number 20050010988 all bigint(16)
text claim text 2 . A garment according to claim 1 , wherein: … all mediumtext
dependent sequence number of claim this is dependent on. NULL if independent claim 1 all varchar(512)
sequence order in which claims appear in patent file 2 all int(11)
num contains claim number of claim formatted as 0-prefixed 5 digit number 00002 all varchar(128)
 
cpc CPC classification at the time the application was submitted
Data Element Name Definition Example Years Present Type
id unique id 002b586d-bdb4-11ea-8a73-121df0c29c1e all varchar(512)
document number document number 20170047568 all bigint(16)
sequence order in which the cpc classifications occur in the raw XML file 14 all int(11)
version date corresponding to the version under which the classification was made 2013-01-01 all date
section_id cpc section  (A = Human Necessitites, B = Performing Operations; Transporting, C = Chemistry; Metallurgy, D = Textiles; Paper, E = Fixed Constructions, F = Mechanical Engineering; Lighting; Heating; Weapons; Blasting Engines or Pumps, G = Physics, H = Electricity, Y = General Tagging of New Technological Developments) B all varchar(16)
subsection_id cpc section id plus cpc class value B65 all varchar(16)
group_id cpc group id: https://www.uspto.gov/web/patents/classification/cpc/html/cpc.html B65H all varchar(16)
subgroup_id cpc subgroup id: https://www.uspto.gov/web/patents/classification/cpc/html/cpc.html B65H35/06 all varchar(16)
symbol_position “F” defining “first” for the sole or first “invention information” CPC, or “L” defining “later” for any second and succeeding “invention information” CPC and for any “noninvention information” CPC L all varchar(4)
value "I" for inventional, "A" for additional I all varchar(4)
category Category of CPC classification "Inventional" or "Aditional" Inventional all varchar(16)
action_date Date corresponding to the approval of the CPC classifiaction 2017-02-16 all date
 
cpc_current Current CPC classification of the application
Data Element Name Definition Example Years Present Type
id unique id 00000e67-b7d6-11ea-83c3-121df0c29c1e all varchar(128)
document number document number 20190113923 all bigint(16)
sequence order in which the cpc classifications occur in the raw XML file 27 all int(11)
version date corresponding to the version under which the classification was made 2013-01-01 all date
section_id cpc section  (A = Human Necessitites, B = Performing Operations; Transporting, C = Chemistry; Metallurgy, D = Textiles; Paper, E = Fixed Constructions, F = Mechanical Engineering; Lighting; Heating; Weapons; Blasting Engines or Pumps, G = Physics, H = Electricity, Y = General Tagging of New Technological Developments) Y all varchar(16)
subsection_id cpc section id plus cpc class value Y02 all varchar(16)
group_id cpc group id: https://www.uspto.gov/web/patents/classification/cpc/html/cpc.html Y02T all varchar(16)
subgroup_id cpc subgroup id: https://www.uspto.gov/web/patents/classification/cpc/html/cpc.html Y02T10/7291 all varchar(16)
symbol_position “F” defining “first” for the sole or first “invention information” CPC, or “L” defining “later” for any second and succeeding “invention information” CPC and for any “noninvention information” CPC L all varchar(4)
value "I" for inventional, "A" for additional A all varchar(4)
category Category of CPC classification "Inventional" or "Aditional" additional all varchar(16)
 
detail_desc_text_{year} Text of detailed description of application for a given year
Data Element Name Definition Example Years Present Type
id unique id d5a51b64-cc8d-11ea-ba95-121df0c29c1e all varchar(512)
document_number document number 20080000006 all bigint(16)
text full text of the detailed description This view also illustrates optional shoulder… all longtext
length length of the text field 72825 all bigint(16)
 
draw_desc_text_{year} Text of description of the application's drawings
Data Element Name Definition Example Years Present Type
id unique id d9536334-bd13-11ea-8a73-121df0c29c1e all varchar(512)
document_number document number 20150007372 all bigint(16)
text text of the drawing descriptions FIG. 22 is a simplified enlarged view… all mediumtext
sequence order in which drawing descriptions appear in application file, often the same as the figure id 15 all int(11)
 
foreign_priority Information about an earlier patent filing in a foreign country which gives the claim priority 
Data Element Name Definition Example Years Present Type
id unique id  00765c3d-cd58-11ea-ba95-121df0c29c1e all varchar(128)
document_number document number 20110296671 all bigint(16)
country country in which the foreign document was filed TW all varchar(256)
date date on which the foreign document was filed 2004-05-20 2002 and after date
foreign_doc_number foreign patent appliation number 93114223 all varchar(256)
kind type of priority claim (international, national, regional) national 2005 and after varchar(20)
 
ipcr International Patent Classification
Data Element Name Definition Example Years Present Type
id unique id 00004129-cca5-11ea-ba95-121df0c29c1e all varchar(128)
document_number patent number 20080166814 all bigint(16)
sequence order in which ipc class appears in patent file 2 all int(11)
version ipc version indicator 2006-01-01 2006 and after date
class_level ipc classification level (A = advanced level)  A 2006 and after varchar(256)
section ipc section (A = Human Necessitites, B = Performing Operations; Transporting, C = Chemistry; Metallurgy, D = Textiles; Paper, E = Fixed Constructions, F = Mechanical Engineering; Lighting; Heating; Weapons; Blasting, G = Physics, H = Electricity) G all varchar(256)
class ipc class 01 all varchar(256)
subclass ipc subclass N all varchar(256)
main_group ipc group 33 all varchar(256)
subgroup ipc subgroup 00 all varchar(256)
symbol_position ipc symbol ( F = first or sole invention information IPC; L = any second or succeeding invention information IPC and any non-invention information IPC) L 2006 and after varchar(256)
class_value ipc classification value ( I = invention information; N = non-invention information) I 2006 and after varchar(256)
action_date issue date of the patent grant 2008-07-10 2006 and after date
class_status ipc classification status ( B = Basic or Original) B 2006 and after varchar(256)
class_data_source ipc classification data source ( H = Human - Generated; M = Machine - Generated; G = Generated via Software) H all varchar(256)
 
pct_data PCT or regional filing data
Data Element Name Definition Example Years Present Type
id unique id 0032334f-cca5-11ea-ba95-121df0c29c1e all varchar(128)
document_number patent number 20080163436 all bigint(16)
pct_doc_number pct document number PCT/EP2005/056103 all varchar(128)
country country filed in WO 2002 and after varchar(256)
date date published or filed 2005-11-21 all date
us_371c124_date PCT Section 371(c)(1)(2)(4)date, date when pct application was filed 2007-07-16 all date
us_371c12_date 35 U.S.C. § 102(e) date for pct applications 2015-08-11 2001 and after date
kind kind of application (numerical for filed, alphabetical for published) 00 2002 and after varchar(20)
doc_type whether the document has been published or just filed pct_application 2002 and after varchar(20)
 
publication Data concerning published applications
Data Element Name Definition Example Years Present Type
id unique id  002e7211-bcfa-11ea-8a73-121df0c29c1e all varchar(128)
document_number document number 20140210614 all bigint(16)
date date of publication 2014-07-31 all date
country country in which application was published US all varchar(128)
kind kind code of publication: https://www.uspto.gov/learning-and-resources/support-centers/electronic-business-center/kind-codes-included-uspto-patent A9 all varchar(8)
filing_type type of publication filing us-republication-corrected all varchar(128)
 
rawassignee Raw assignee information as it appears in the source XML files
Data Element Name Definition Example Years Present Type
id unique id 003ca929-cca5-11ea-ba95-121df0c29c1e all varchar(128)
document_number document number 20080163497 all bigint(16)
assignee_id unique assignee ID generated by the disambiguation algorithm 000009fd-0268-46a4-b493-c8e5f7672331 all varchar(64)
sequence order in which assignee appears in patent file 1 all int(11)
name_first first name, if assignee is individual Douglas Norman all varchar(256)
name_last last name, if assignee is individual Hartz all varchar(256)
organization organization name if assignee is organization BIC-Violex SA all varchar(256)
type assignee type (1- Unassigned, 2 - US Company or Corporation, 3 - Foreign Company or Corporation, 4 - US Individual, 5 - Foreign Individual, 6 - US Federal Government, 7 - Foreign Government, 8 - US County Government, 9 - US State Government. Note: A "1" appearing before any of these codes signifies part interest) 4 2002 and after int(11)
rawlocation_id assignee's location. If the assignee's sequence number is 1, then his/her location is the "location" of the related patent 78b9e85a-3bc6-11eb-a3cd-121df0c29c1e all varchar(128)
city city of the assignee at the time the application was filed Chambersburg all varchar(256)
state state of the assignee at the time the application was filed PA all varchar(256)
country country of the assignee at the time the application was filed US all varchar(256)
 
rawinventor Raw inventor information as it appears in the source XML files
Data Element Name Definition Example Years Present Type
id unique id 00005095-bd4c-11ea-8a73-121df0c29c1e all varchar(128)
document_number document number 20150287775 all bigint(16)
inventor_id unique inventor id generated by the disambiguation algorithm fl:n_ln:benichou-1 all varchar(64)
name_first first name of the inventor Shigeharu all varchar(256)
name_last last name of the inventor Yamagami all varchar(256)
sequence order in which the inventor appears in patent file 2 all int(11)
designation designation of the inventor us-only all varchar(32)
deceased flag determining whether the inventor is deceased or not FALSE all boolean
rawlocation_id inventor's location. If the inventor's sequence number is 0, then his/her location is the "location" of the related patent 00000249-bdce-11ea-8a73-121df0c29c1e all varchar(128)
city city of the inventor at the time the application was filed Blacksburg all varchar(256)
state state of the inventor at the time the application was filed VA all varchar(256)
country country of the inventor at the time the application was filed US all varchar(256)
 
rawlocation Raw location information as it appears in the source XML files
Data Element Name Definition Example Years Present Type
id unique id 00003475-bdce-11ea-8a73-121df0c29c1e all varchar(512)
location_id unique location id generated by the disambiguation algorithm 8bbe6857-791f-11eb-bfee-121df0c29c1e all varchar(128
city city  Rancho Santa Margarita all varchar(256)
state state CA all varchar(256)
country country US all varchar(256)
 
rel_app_text Description of related patent applications
Data Element Name Definition Example Years Present Type
id unique id  d600a678-bc7a-11ea-8a73-121df0c29c1e all varchar(512)
document_number document_number 20200053967 all bigint(16)
text text of the description of the related application in question This application claims the benefit … all mediumtext
 
us_parties US party and applicant information as it appears in the source XML files
Data Element Name Definition Example Years Present Type
id unique id 00391dba-0d76-11eb-a344-121df0c29c1e all varchar(128)
document_number document number 20200075150 all bigint(16)
name_first first name of the party/applicant, if an individual Janet all varchar(256)
name_last last name of the party/applicant, if an individual McCallister all varchar(256)
organization organization of the party/applicant, if applicable FUJIFILM Corporation all varchar(256)
type type of the party/applicant applicant all varchar(64)
designation designation of the party/applicant us-only all varchar(32)
sequence order in which the party/applicant appears in the source XML file 3 all int(11)
rawlocation_id parties' unique location id 0000c51a-0d76-11eb-a344-121df0c29c1e all varchar(128)
city city of the party/applicant at the time the application was filed Tallahassee all varchar(256)
state state of the party/applicant at the time the application was filed FL all varchar(256)
country country of the party/applicant at the time the application was filed US all varchar(256)
 
uspc USPTO patent classification at patent issue date
Data Element Name Definition Example Years Present Type
id unique id 00008098-d1fb-11ea-ba95-121df0c29c1e all varchar(128)
document number document number 20060263593 all bigint(16)
mainclass_id uspc mainclass at issue 313 all varchar(16)
subclass_id uspc subclass at issue 313/116000 all varchar(16)
sequence order in which uspc class appears in patent file 7 all int(11)
 
usreldoc US Related Documents for applications
Data Element Name Definition Example Years Present Type
id unique id 002832bb-c5e5-11ea-ba95-121df0c29c1e 2002 and after varchar(128)
document_number document number 20170290264 2002 and after bigint(16)
related_doc_number related document number  14857618 2002 and after varchar(32)
country country where related document was published US 2002 and after varchar(128)
doc_type defines the type of documentation continuation 2002 and after varchar(128)
relkind kind of document (text) parent-doc 2002 and after varchar(128)
date publication date of the related document 2015-09-17 2002 and after date