Skip to main content
 
 
 
IN THIS SECTION

Data Download Tables

About Bulk Download Database Tables

PatentsView offers publicly accessible patent research data sets with detailed documentation. PatentsView database tables can be bulk downloaded as individual files in a tab- delimited format for programmers and researchers who prefer to work with the data in their native environments.

Table Name Description # of Rows Origin Data Last Updated
pregrant applications (PGPubs) Information on the published applications (PGPubs)   raw & disamb  
application
zip: 75.6 MiB, tsv: 412.9 MiB
Information on the applications for granted patent 7,718,333 raw July 8, 2021
assignee
zip: 18.5 MiB, tsv: 38.7 MiB
Disambiguated assignee data for granted patents and pre-granted applications 530,735 disamb July 8, 2021
botanic
zip: 614.2 KiB, tsv: 1.2 MiB
Botanic information for plant patents 17,477 raw July 8, 2021
brf_sum_text Brief summary text   raw  
claim Full text of patent claims, including dependency and sequence   raw  
cpc_current
zip: 1.4 GiB, tsv: 3.9 GiB
Current CPC classification data for all patents (applied retrospectively to all patents) 43,539,024 raw (from separate classification files) July 8, 2021
cpc_group
zip: 21.5 KiB, tsv: 67.8 KiB
Lookup table of current CPC groups 672 raw (from separate classification files) July 8, 2021
cpc_subgroup
zip: 5.4 MiB, tsv: 61.1 MiB
Lookup table of current CPC subgroups 260,874 raw (from separate classification files) July 8, 2021
cpc_subsection
zip: 3.2 KiB, tsv: 7.9 KiB
Lookup table of current CPC subsections 136 raw (from separate classification files) July 8, 2021
detail_desc_text Detailed patent description text   raw  
draw_desc_text Drawing description text

Reloaded 2020 table, removed duplicates and updated sequences so that all start at zero. April 21, 2021

Tables for years 1976 through 2001 have been reparsed to retain line breaks and address data issues. May 3, 2021
  raw  
foreign_priority
zip: 123.4 MiB, tsv: 295.0 MiB
Foreign priority data 3,586,169 raw July 8, 2021
figures
zip: 165.4 MiB, tsv: 293.3 MiB
Number of figures and sheets 7,186.567 raw July 8, 2021
foreigncitation
zip: 1.0 GiB, tsv: 2.6 GiB
Citations made to foreign patents by US patents 31,732,921 raw July 8, 2021
government_interest
zip: 4.9 MiB, tsv: 34.5 MiB
Raw government interest statements on all patents (where available) 155,351 raw July 8, 2021
government_organization
zip: 5.9 KiB, tsv: 34.1 KiB
Organization names and related agency hierarchy parsed from the government interest statements on all patents (where available) 297 processed July 8, 2021
inventor
zip: 41.0 MiB, tsv: 176.3 MiB
Disambiguated inventor data for granted patents and pre-granted applications 4,412,719 disamb July 8, 2021
ipcr
zip: 599.5 MiB, tsv: 1.7 GiB
International Patent Classification data for all patents (as of publication date)

18,302,110

raw July 8, 2021
lawyer
zip: 5.7 MiB, tsv: 12.2 MiB
Disambiguated lawyer data 177,822 disamb July 8, 2021
location
zip: 9.3 MiB, tsv: 32.7 MiB
Disambiguated location data, including latitude and longitude for granted patents and pre-granted applications 395,138 disamb July 8, 2021
mainclass
zip: 2.4 KiB, tsv: 7.1 KiB
Lookup table of original USPC main classes (as of patent publication date) 1,239 raw July 8, 2021
mainclass_current
zip: 7.5 KiB, tsv: 21.5 KiB
Lookup table of current USPC main technology classes (applied retrospectively to all patents) 510 raw (from separate classification files) July 8, 2021
nber zip: 115.3 MiB, tsv: 228.9 MiB NBER classification data for all patents up to May 2015 5,105,937 raw (from separate classification files) July 8, 2021
nber_category zip: 222.0 B, tsv: 113.0 B Lookup table for NBER categories 7 raw (from separate classification files) July 8, 2021
nber_subcategory
zip: 625.0 B, tsv: 928.0 B
Lookup table for NBER subcategories 38 raw (from separate classification files) July 8, 2021
non_inventor_applicant
zip: 240.5 MiB, tsv: 511.9 MiB
Non-inventor applicant information 4,552,324 raw July 8, 2021
otherreference
zip: 3.7 GiB, tsv: 7.7 GiB
Non-patent citations mentioned in patents (e.g. articles, papers, etc.) 45,677,728 raw July 8, 2021
patent
zip: 1.5 GiB, tsv: 5.7 GiB
Data on granted patents 7,720,592 raw July 8, 2021
patent_assignee
zip: 137.8 MiB, tsv: 577.1 MiB
Metadata table for many-to-many relationships 7,121,431 disamb (linking table) July 8, 2021
patent_contractawardnumber
zip: 1.6 MiB, tsv: 11.4 MiB
Contract or award numbers parsed from the government interest statements on all patents (where available) 188,018 processed July 8, 2021
patent_govintorg
zip: 741.9 KiB, tsv: 9.2 MiB
Metadata table with patent-to-organization relationships linked to the government_organization table 189,095 processed July 8, 2021
patent_inventor
zip: 224.2 MiB, tsv: 1.1 GiB
Metadata table for many-to-many relationships 18,987,127 disamb (linking table) July 8, 2021
patent_lawyer
zip: 120.9 MiB, tsv: 379.3 MiB
Metadata table for many-to-many relationships 8,813,823 disamb (linking table) July 8, 2021
pct_data
zip: 56.0 MiB, tsv: 158.8 MiB
PCT data 1,594,284 raw July 8, 2021
persistent_assignee_disambig
zip: 762.5 MiB, tsv: 1.5 GiB

Persistent Assignee Disambiguation
Updated to include missing values in the disamb_assignee_id_20201229 column field.

7,121,431 raw July 8, 2021
persistent_inventor_disambig
zip: 535.1 MiB, tsv: 2.8 GiB
Persistent Inventor Disambiguation 18,273,330 raw July 8, 2021
rawassignee
zip: 508.0 MiB, tsv: 978.6 MiB
Raw assignee information as it appears in the source text and XML files 7,071,459 raw July 8, 2021
rawexaminer
zip: 342.7 MiB, tsv: 737.0 MiB
Raw examiner information 10,436,193 raw July 8, 2021
rawinventor
zip: 1.1 GiB, tsv: 2.2 GiB
Raw inventor information as it appears in the source text and XML files 18,836,005 raw July 8, 2021
rawgender
zip: 42.7 MiB, tsv: 188.4 MiB
Gender assignment on disambiguated inventor data, performed 12-29-2020. Methods Report

4,305,415

raw April 14, 2020
rawlawyer
zip: 458.9 MiB, tsv: 929.8 MiB
Raw lawyer information as it appears in the source text and XML files 8,814,407 raw July 8, 2021
rawlocation
zip: 996.1 MiB, tsv: 2.7 GiB
Raw location data for inventors and assignees, as it appears in xml and text source files 30,482,996 raw July 8, 2021
rel_app_text
zip: 215.7 MiB, tsv: 913.2 MiB
Related applications text 2,039,709 raw July 8, 2021
subclass
zip: 599.4 KiB, tsv: 2.6 MiB
Lookup table of original USPC subclasses (as of patent publication date) 272,540 raw July 8, 2021
subclass_current
zip: 2.1 MiB, tsv: 7.3 MiB
Lookup table of current USPC subclasses (applied retrospectively to all patents) 168,048 raw (from separate classification files) July 8, 2021
us_term_of_grant
zip: 92.0 MiB, tsv: 216.8 MiB
U.S. term of grant data 3,802,048 raw July 8, 2021
usapplicationcitation
zip: 1.9 GiB, tsv: 5.6 GiB
Citations made to US patent applications by US patents 47,268,557 raw July 8, 2021
uspatentcitation
zip: 4.4 GiB, tsv: 11.2 GiB
Citations made to US granted patents by US patents 117,189,472 raw July 8, 2021
uspc
zip: 490.5 MiB, tsv: 963.5 MiB
USPC classification data for all patents 18,058,705 raw July 8, 2021
uspc_current
zip: 619.0 MiB, tsv: 1.2 GiB
Current USPC classification data for all patents up to May 2015 22,852,958 raw (from separate classification files) July 8, 2021
usreldoc
zip: 391.2 MiB, tsv: 1.1 GiB
U.S. related documents (post-2005 patents only) 11,717,159 raw July 8, 2021
wipo
zip: 28.9 MiB, tsv: 164.2 MiB
WIPO technology fields for all patents 10,296,077 raw (from separate classification files) July 8, 2021
wipo_field
zip: 1.5 KiB, tsv: 3.7 KiB500 bytes
Lookup table of WIPO technology fields 70 raw (from separate classification files) July 8, 2021

The PatentsView database is created from the U.S. Patent and Trademark Office (USPTO) public bulk data releases available at https://developer.uspto.gov/data. These data releases provide information on published patent applications (since 2001) and granted patents (since 1976). The PatentsView database for patents is available for download upon request, MySQL dump. The PatentsView database for  published applications contains pre-grant publications (PGPub) of patent applications (see MPEP § 1120) and is available in the top row of data in the table above. At this time, the published applications database does not contain the earliest years (2001-2005) of data or any of the disambiguation results.

For more information, visit the Methods and Sources section of the website.

This work was created through a government contract funded by the Office of Chief Economist in the US Patent and Trademark Office. Users are free to use, share, or adapt the material for any purpose, subject to the standards of the Creative Commons Attribution 4.0 International License.

Attribution should be given to PatentsView (www.patentsview.org) for use, distribution, or derivative works.