Skip to main content
 
 
 
IN THIS SECTION
5 posts
lye
Last seen: 11/05/2019 - 09:35
Joined: 10/28/2019 - 13:32
Application_id in "usapplicationcitation" not matched to application id in "application"

Dear Patentsview,

Thank you very much for your effort cleaning and organizing the patent data. As a researcher focusing on innovation, your dataset has been a great help.

However, when I am trying to analyse the citations made to patent applications, I find that the application_id in "usapplicationcitation" file is not matched to the application id in "application" file.

To give an example, the application_id in "usapplicationcitation" file is composed of 2012/20120024751 year/year7digitnumber.

However, id in "application" file is composed of 12/377967 series_code/6digitnubmer, the same as USPTO has specified:

Application Number - Made up of a two digit series code followed by a six digit serial number which is assigned by the USPTO (Example: 99999999 or 99/999999) https://www.uspto.gov/patents-application-process/checking-application-status/search-application

May I ask how do I link the application_id in the "usapplicationcitation" file to the application id in "application" file please? Thank you very much! Let me know if you need more information.

Russ
Last seen: 03/15/2024 - 23:34
Joined: 11/14/2017 - 22:15
This has come up before.  A…

This has come up before.  A granted patent's document number, like 20120024751 cited in your example, is not provided in the bulk grant xml files that the patentsview database is based on.  It is present in bulk application xml files where the patent's serial number field, 12/377967 in your example, is also present.  The same serial number field is present in the patent table/granted xml data. So you'd need to process the bulk application files in order to tie the cited document numbers to its serial number to see if the patent was ever granted. Not all patent applications become issued patents but the application can still be cited by other patents.  In other words, if you match a document number to its serial number (using the application xml), you may not find that same serial number in the patents table/bulk granted xml files/granted uspto search.  

So you can't currently use the api or the data downloads to figure this out but you could do uspto searches. You'd first need to look up the document number on http://appft.uspto.gov/ to get the filing date: dn/20120024751.  You can then do an advanced search at http://patft.uspto.gov/netahtml/PTO/search-adv.htm to see if it was granted by searching for the serial number without the series code anded with the application date apn/847107 and apd/20100730 which, in this case, redirects you to an issued patent 8,540,871

See
http://www.patentsview.org/community/forum/7/topic/107

http://www.patentsview.org/community/forum/7/topic/130

Russ

lye
Last seen: 11/05/2019 - 09:35
Joined: 10/28/2019 - 13:32
Thank you very much!!! It…

Thank you very much!!! It took me quite a while to understand the previous posts and your suggestions. I now see that in the application xml files (http://patents.reedtech.com/parbft.php), I can extract both the application grant number like 20120024751 and its corresponding serial number/6digit. This is the linking table I'm searching for to link the usapplicationcitation.tsv to application.tsv. Am I right? Thank you very much!

Russ
Last seen: 03/15/2024 - 23:34
Joined: 11/14/2017 - 22:15
Yes, you've got it!  The…

Yes, you've got it!  The match up would be much easier if the document number was in the grant xml files.  It's a shame you'd have to process all the application xml files to get it.  Another alternative would be to use the PEDS api's bulk data.  Their grant xml has both a ApplicationNumberText (full serial number without the slash) and PublicationNumber (document number), at least in the sample I looked at.  It's a massive, 56G file but it would just be one file vs downloading all the reedtech application xml files.

Things I mentioned in the other posts still apply:  1) the cited application may never have been granted 2) in the case of my friend's patent, it took 10 or so years to be granted.  You may not be able to match a cited document number to a granted patent today but if you check again some time in the future, that application may have become a granted patent.  

Russ

PVTeam
Role: moderator
Last seen: 03/15/2024 - 15:25
Joined: 10/17/2017 - 10:47
Re: APPLICATION_ID IN "USAPPLICATIONCITATION" NOT MATCHED TO APP

Hi,

In addition to the above, you may find this useful: http://data.patentsview.org/application_crosswalk.csv.zip. This is an application crosswalk file for patents between 2001-2016. 

Thanks,

PVTeam