Skip to main content
 
 
 
IN THIS SECTION

Data Download Tables

About Bulk Download Database Tables

PatentsView offers publicly accessible patent research data sets with detailed documentation. PatentsView database tables can be bulk downloaded as individual files in a tab- delimited format for programmers and researchers who prefer to work with the data in their native environments.

Table Name Description # of Rows Origin Data Last Updated
pregrant applications (PGPubs) Information on the published applications (PGPubs)   raw & disamb  
application
zip: 77.5 MiB, tsv: 422.8 MiB
Information on the applications for granted patent 7,903,067 raw January 18, 2022
assignee
zip: 18.8 MiB, tsv: 39.2 MiB
Disambiguated assignee data for granted patents and pre-granted applications 538,617 disamb January 18, 2022
botanic
zip: 635.7 KiB, tsv: 1.3 MiB
Botanic information for plant patents 18,065 raw January 18, 2022
brf_sum_text Brief summary text   raw  
claim Full text of patent claims, including dependency and sequence   raw  
cpc_current
zip: 1.5 GiB, tsv: 4.0 GiB
Current CPC classification data for all patents (applied retrospectively to all patents) 45,263,138 raw (from separate classification files) January 18, 2022
cpc_group
zip: 21.5 KiB, tsv: 67.9 KiB
Lookup table of current CPC groups 673 raw (from separate classification files) January 18, 2022
cpc_subgroup
zip: 5.4 MiB, tsv: 61.1 MiB
Lookup table of current CPC subgroups 261,115 raw (from separate classification files) January 18, 2022
cpc_subsection
zip: 3.2 KiB, tsv: 7.9 KiB
Lookup table of current CPC subsections 136 raw (from separate classification files) August 11, 2021
detail_desc_text Detailed patent description text   raw  
draw_desc_text Drawing description text   raw  
foreign_priority
zip: 126.4 MiB, tsv: 302.1 MiB
Foreign priority data 3,668,091 raw January 18, 2022
figures
zip: 169.6 MiB, tsv: 300.8 MiB
Number of figures and sheets 7,366,137 raw January 18, 2022
foreigncitation
zip: 1.1 GiB, tsv: 2.8 GiB
Citations made to foreign patents by US patents 33,100,597 raw January 18, 2022
government_interest
zip: 5.0 MiB, tsv: 35.5 MiB
Raw government interest statements on all patents (where available) 159,086 raw January 18, 2022
government_organization
zip: 5.9 KiB, tsv: 34.1 KiB
Organization names and related agency hierarchy parsed from the government interest statements on all patents (where available) 297 processed August 11, 2021
inventor
zip: 44.8 MiB, tsv: 205.4 MiB
Disambiguated inventor data for granted patents and pre-granted applications 4,483,943 disamb January 18, 2022
ipcr
zip: 631.2 MiB, tsv: 1.8 GiB
International Patent Classification data for all patents (as of publication date)

19,161,256

raw January 18, 2022
lawyer
zip: 5.9 MiB, tsv: 12.8 MiB
Disambiguated lawyer data 179,697 disamb January 18, 2022
location
zip: 5.2 MiB, tsv: 19.3 MiB
Disambiguated location data, including latitude and longitude for granted patents and pre-granted applications 242,926 disamb January 18, 2022
mainclass
zip: 2.4 KiB, tsv: 7.1 KiB
Lookup table of original USPC main classes (as of patent publication date) 1,239 raw August 11, 2021
mainclass_current
zip: 7.5 KiB, tsv: 21.5 KiB
Lookup table of current USPC main technology classes (applied retrospectively to all patents) 510 raw (from separate classification files) August 11, 2021
nber zip: 115.3 MiB, tsv: 228.9 MiB NBER classification data for all patents up to May 2015 5,105,937 raw (from separate classification files) August 11, 2021
nber_category zip: 222.0 B, tsv: 113.0 B Lookup table for NBER categories 7 raw (from separate classification files) August 11, 2021
nber_subcategory
zip: 625.0 B, tsv: 928.0 B
Lookup table for NBER subcategories 38 raw (from separate classification files) August 11, 2021
non_inventor_applicant
zip: 251.7 MiB, tsv: 536.9 MiB
Non-inventor applicant information 4,753,137 raw January 18, 2022
otherreference
zip: 3.9 GiB, tsv: 8.1 GiB
Non-patent citations mentioned in patents (e.g. articles, papers, etc.) 47,761,497 raw January 18, 2022
patent
zip: 1.5 GiB, tsv: 5.8 GiB
Data on granted patents 7,905,326 raw January 18, 2022
patent_assignee
zip: 145.7 MiB, tsv: 611.9 MiB
Metadata table for many-to-many relationships 7,304,703 disamb (linking table) January 18, 2022
patent_contractawardnumber
zip: 1.4 MiB, tsv: 4.7 MiB
Contract or award numbers parsed from the government interest statements on all patents (where available) 191,993 processed January 18, 2022
patent_govintorg
zip: 650.6 KiB, tsv: 2.5 MiB
Metadata table with patent-to-organization relationships linked to the government_organization table 195,065 processed January 18, 2022
patent_inventor
zip: 254.7 MiB, tsv: 1.2 GiB
Metadata table for many-to-many relationships 19,378,853 disamb (linking table) January 18, 2022
patent_lawyer
zip: 116.4 MiB, tsv: 388.6 MiB
Metadata table for many-to-many relationships 9,026,892 disamb (linking table) January 18, 2022
pct_data
zip: 58.4 MiB, tsv: 165.8 MiB
PCT data 1,663,951 raw January 18, 2022
persistent_assignee_disambig
zip: 1.5 GiB, tsv: 766.4 MiB

Persistent Assignee Disambiguation
Updated to include missing values in the disamb_assignee_id_20201229 column field.

7,304,703 raw January 18, 2022
persistent_inventor_disambig
zip: 536.2 MiB, tsv: 2.8 GiB
Persistent Inventor Disambiguation 18,273,330 raw January 18, 2022
rawassignee
zip: 522.2 MiB, tsv: 1009.7 MiB
Raw assignee information as it appears in the source text and XML files 7,251,050 raw January 18, 2022
rawexaminer
zip: 350.8 MiB, tsv: 753.5 MiB
Raw examiner information 10,659,166 raw January 18, 2022
rawinventor
zip: 1.1 GiB, tsv: 2.2 GiB
Raw inventor information as it appears in the source text and XML files 19,378,853 raw January 18, 2022
rawgender
zip: 42.7 MiB, tsv: 188.4 MiB
Gender assignment on disambiguated inventor data through March 30, 2021. Methods Report

4,305,415

raw April 14, 2020
rawlawyer
zip: 470.2 MiB, tsv: 977.4 MiB
Raw lawyer information as it appears in the source text and XML files 9,051,384 raw January 18, 2022
rawlocation
zip: 1.0 GiB, tsv: 3.1 GiB
Raw location data for inventors and assignees, as it appears in xml and text source files 31,406,248 raw January 18, 2022
rel_app_text
zip: 221.6 MiB, tsv: 954.4 MiB
Related applications text 2,108,214 raw January 18, 2022
subclass
zip: 599.4 KiB, tsv: 2.6 MiB
Lookup table of original USPC subclasses (as of patent publication date) 272,546 raw January 18, 2022
subclass_current
zip: 2.1 MiB, tsv: 7.3 MiB
Lookup table of current USPC subclasses (applied retrospectively to all patents) 168,048 raw (from separate classification files) August 11, 2021
us_term_of_grant
zip: 94.9 MiB, tsv: 224.3 MiB
U.S. term of grant data 3,921,900 raw January 18, 2022
usapplicationcitation
zip: 2.0 GiB, tsv: 6.0 GiB
Citations made to US patent applications by US patents 50,573,718 raw January 18, 2022
uspatentcitation
zip: 4.5 GiB, tsv: 11.6 GiB
Citations made to US granted patents by US patents 121,137,053 raw January 18, 2022
uspc
zip: 490.6 MiB, tsv: 963.7 MiB
USPC classification data for all patents 18,063,760 raw January 18, 2022
uspc_current
zip: 619.0 MiB, tsv: 1.2 GiB
Current USPC classification data for all patents up to May 2015 22,852,958 raw (from separate classification files) August 11, 2021
usreldoc
zip: 409.1 MiB, tsv: 1.2 GiB
U.S. related documents (post-2005 patents only) 12,245,208 raw January 18, 2022
wipo
zip: 29.7 MiB, tsv: 168.7 MiB
WIPO technology fields for all patents 10,551,403 raw (from separate classification files) January 18, 2022
wipo_field
zip: 1.5 KiB, tsv: 3.7 KiB500 bytes
Lookup table of WIPO technology fields 70 raw (from separate classification files) August 11, 2021

The PatentsView database is created from the U.S. Patent and Trademark Office (USPTO) public bulk data releases available at https://developer.uspto.gov/data. These data releases provide information on published patent applications (since 2001) and granted patents (since 1976). The PatentsView database for patents is available for download upon request, MySQL dump. The PatentsView database for  published applications contains pre-grant publications (PGPub) of patent applications (see MPEP § 1120) and is available in the top row of data in the table above. At this time, the published applications database does not contain the earliest years (2001-2005) of data.

For more information, visit the Methods and Sources section of the website.

This work was created through a government contract funded by the Office of Chief Economist in the US Patent and Trademark Office. Users are free to use, share, or adapt the material for any purpose, subject to the standards of the Creative Commons Attribution 4.0 International License.

Attribution should be given to PatentsView (www.patentsview.org) for use, distribution, or derivative works.