As best I can tell if we want to know when any given table was last updated I have to visit the data download page and check each cell in the tables; which is not very machine readable. It would be really useful if there were a lastupdated.json (or csv or xml) document that contained both the last updated information and link to download the latest dataset (checksums would be a nice to have as well).
For example,
{
"data_up_to": "20230629T000000Z",
"data_released_on": "20230920T000000Z",
"granted_tables": [
{
"name": "g_applicant_not_disambiguated",
"last_updated": "20230920T000000Z",
"download_link": "https://s3.amazonaws.com/data.patentsview.org/download/g_applicant_not_disambiguated.tsv.zip",
"md5": "xxxxxxxxx",
"sha256": "xxxxxxxxxxxxxxxx"
},
{
"name": "g_assignee_disambiguated",
"last_updated": "20230927T000000Z",
"download_link": "https://s3.amazonaws.com/data.patentsview.org/download/g_assignee_disambiguated.tsv.zip",
"md5": "xxxxxxxxx",
"sha256": "xxxxxxxxxxxxxxxx"
}
...
],
"pregrant_tables": [ .... ]
}
Does anything like this exist? This would help greatly with local development processes and staying up to date.