Hi, there seems to be a problem with the text in Detailed Description data for 2005-onwards, for both granted patents as well as pre-grant publications of applications.
The issue is that words and numbers (references to features in the Drawings) in the Description text are bunched together and missing spaces. Two examples from 2018 granted patents (see bolded portions):
"As shown inFIGS. 3A, 4A, and 5A, the main part of the airbag25is formed by folding a single fabric piece (also referred to as a base fabric sheet, or a fabric panel) along a fold line26, which is a folding portion at the center in the width direction, to be superposed on itself in the automobile width direction, and joining the superposed parts. To distinguish the two superposed parts of the airbag25, the part located on the inner side will be referred to as a first fabric portion27, and the part located on the outer side will be referred to as a second fabric portion28."
"Continuing with the prior embodiment and in some other instances, the mobile device app303is also configured to communicate a geographical location of the employee's mobile device to the employee monitor301to use or as one or more automated clock actions. So, assuming the employee consents and is not coerced in any manner, the employee's mobile device has the mobile device app303installed and processing on that mobile device and the geographical position can be used to initiate automated clock actions."
Most, if not all records from 2005 onwards seem to have this problem. I have not checked Brief Summary text to see if this problem is present in those records too.
Records from 1976 to 2004 do not appear to have this problem.
I suspect it has to do with the change in the raw data XML format to 4.0+ from 2005 onwards?