Hi,
I am attempting to create an updated database for my research. I have done this several time without issue. This time I am have tremendous problems with corrupted databases working with the g_patent.tsv file.
I have downloaded the lasted version (twice and happens with both).
Process is to use SQLite3 to create the database.
1. Create new database using DB Browser for SQLite
2. load g_patent.tsv with the options to disable the auto data detection (This is necessary because patent_id initially start as numbers, but have text at end of file - side note if the text cases are first in the file then auto detect would register the type correctly)
3. Set the data type with the following command.
CREATE TABLE "patent" (
"patent_id" TEXT,
"patent_type" TEXT,
"patent_date" TEXT,
"patent_title" TEXT,
"patent_abstract" TEXT,
"wipo_kind" TEXT,
"num_claims" INTEGER,
"withdrawn" INTEGER,
"filename" TEXT
);
4. Create an index on the patent_id with the following command
CREATE INDEX "pat_pat" ON "patent" (
"patent_id" ASC
);
This where the error occurs and the database becomes corrupted with the following information
(11) database corruption at line 66843 of [1b256d97b5]
(11) database corruption at line 66993 of [1b256d97b5]
(11) statement aborts at 19: [CREATE INDEX "pat_pat" ON "patent" (
"patent_id" ASC
);] database disk image is malformed
(1) executeSQL: "database disk image is malformed (CREATE INDEX \"pat_pat\" ON \"patent\" (\n\t\"patent_id\"\tASC\n);)" (, :0)
This appears to be reproducible with same line numbers as the corruption point.
Row 66843
"10067308" "utility" "2018-09-04" "Low profile fiber distribution hub" "Certain embodiments of a fiber distribution hub include a swing frame pivotally mounted within an enclosure having a low profile. For example, the enclosure can have a depth of less than about nine inches. Termination modules can be mounted to the swing frame and oriented to slide at least partially in a front-to-rear direction to facilitate access to connectors plugged into the termination modules. Splitter modules and connector storage regions can be provided within the enclosure." "B2" 31 0 "ipg180904.xml"
Row 66993
"10067459" "utility" "2018-09-04" "Image forming apparatus" "An image forming apparatus includes a main assembly, an operating portion provided slidably between a first position and a second position of the main assembly, a supporting position, a slide rail, a slidable member, and an urging unit. A relationship of engagement between the slide rail and the slidable member is set so that the engagement between the slide rail and the slidable member when the operating portion is in a position between the first position and the second position is looser than the engagement between the slide rail and the slidable member when the operating portion is in the first position or the second position." "B2" 6 0 "ipg180904.xml"
I see no obvious issues.
I do not seem to have issues with
g_cpc_at_issue.tsv
g_assignee_disambiguated.tsv
g_us_patent_citation.tsv
only g_patent.tsv
Any clues?
Andy