Showing posts with label USPTO. Show all posts
Showing posts with label USPTO. Show all posts

Tuesday, December 12, 2017

Google Patents public dataset

Since october 31st 2017 are available in google cloud and BigQuery platform patents data that stand behind google patents, that means worldwide bibliographic information on more than 90 million patent publications from 17 countries and US full text, provided by IFI CLAIMS Patent Services.

https://cloud.google.com/blog/big-data/2017/10/google-patents-public-datasets-connecting-public-paid-and-private-patent-data

in below page you can also find more details and examples

https://console.cloud.google.com/launcher/details/google_patents_public_datasets/google-patents-public-data

Friday, September 15, 2017

Patentsview mysql upload scripts

In previous days the new version of Patentsview database has been release

The PatentsView initiative was established in 2012 and is a collaboration between USPTO, US Deptartment of Agriculture (USDA)(1), the Center for the Science of Science and Innovation Policy, New York University, the University of California at Berkeley, Twin Arch Technologies, and Periscopic.
The PatentsView platform is built on a newly developed database that longitudinally links inventors, their organizations, locations, and overall patenting activity. The platform uses data derived from USPTO bulk data files. These data are provided for research purposes and do not constitute the official USPTO record.

From this link you can download my scripts for mysql to upload the new data.

Tuesday, August 22, 2017

2017 PatentsView Workshop on Engaging User Communities

The US Patent and Trademark Office’s Chief Economist has open  invitations to the 2017 PatentsView Workshop on Engaging User Communities.

The workshop is open to the public and will be held on Friday October 6, from 8:30am – 12:30pm on the USPTO campus in Alexandria, VA.

USPTO will officially launch at the workshop the PatentsView Community Site, updated data visualization with export functionality, and new data fields that can be accessed across PatentsView tools.  This year’s updates are based on feedback gathered by the user community at the 2016 PatentsView workshop.  The new Community Site includes a moderated forum for user inquiries and a Data in Action page for sharing analyses, visualizations, and publications. 

The goals of the workshop are:

(1) to launch the new Community Site and Data Visualization features;

(2) to present newly parsed and available patent data fields; and

(3) to gather feedback from patent data and analytics user communities in order to set priorities for future PatentsView open data products.  


More informations here.


Agenda buildup is in progress and it will be fully delivered in the coming weeks. 

Sunday, June 7, 2015

PatentsView Inventor Disambiguation Technical Workshop

On behalf of the US Patent & Trademark Office, the American Institutes for Research (AIR) is hosting an inventor disambiguation technical workshop.  USPTO is seeking creative new approaches to get better information on innovators and the new technologies they develop by disambiguating inventor names.

AIR invites individual researchers or research teams to develop inventor disambiguation algorithms using US patent data. The top fifteen teams will be invited to present their results at the final workshop, which will be held at USPTO headquarters September 23 and 24.  The researcher or team that contributes the most effective algorithm will receive a $25,000 stipend.

July 1st is the Deadline for prospective participants to submit a 1-page “intent to participate” document. This includes any requests to incorporate additional data, software, or hardware requirements. All teams with proposals deemed to be reasonable (by the judges’ panel) will be invited to participate.

Additional information is posted at www.dev.patentsview.org/workshop, together with the training datasets.

Wednesday, November 21, 2012

Patent attorney for USPTO

As wikipedia describes (http://en.wikipedia.org/wiki/Patent_attorney)
A patent attorney is an attorney who has the specialized qualifications necessary for representing clients in obtaining patents and acting in all matters and procedures relating to patent law and practice, such as filing an opposition. The term is used differently in different countries, and thus may or may not require the same legal qualifications as a general legal practitioner.

Such data may be useful to be introduced f.i. in applicants disambiguations (two very similar names filing via  the same attorney are more likely to be the same company...)

Bytheway suc data is not available in patstat, but is in USPTO data

One example of website displying suh data is: http://www.patentmaps.com/attorney/index.html


Wednesday, September 21, 2011

About Us patent reform

@this link
http://patentmanager.blogspot.com/2011/09/us-patent-reform-made-easy-whats-in-it.html
you may find an exaustive synthesis of waht will change with the new patent reform recently approved in USPTO.



Thursday, January 27, 2011

Link between US patent classes and IPC

This one comes straight from epo patstat forum, you may find the original @ this link.

If you are in need of matching US Patend Classification (USPC in short) against IPC, there are some web pages in USPTO web site that may be useful:


USPC-to-IPC Reverse Concordance (entries are grouped by IPC notation, rather than USPC notation): http://www.uspto.gov/web/patents/classification/international/ipc/ipc8/ipc_concordance/ipcsel.htm

US-to-IPC Concordance Pages from Classification Home (by choosing US-to-IPC8 Concordance in the radio button) : http://www.uspto.gov/web/patents/classification/index.htm

For more details, see USPTO dedicated page:
http://www.uspto.gov/web/patents/classification/international.htm

Wednesday, November 10, 2010

Double publication numbers by application id in USPTO

As previously posted, several application authority may have more than one publication number referring to the same application id.

In the case of uspto these are the main publication kind pairs of publication numbers referred to the same application id. It may be useful for creating a disambiguation table.


kind1 kind2 count
'A'  'B1' 3895
'A'  'B2' 124
'A'  'C1' 356
'A'  'E1' 12
'A1'  'A2' 521
'A1'  'A9' 1662
'A1'  'B1' 18410
'A1'  'B2' 845793
'A1'  'H1' 43
'A2'  'B2' 169
'A9'  'B2' 768
'B1'  'B2' 142
'E'  'F1' 59
'P1'  'P2' 424
'P1'  'P3' 1686

Bytheway, counting the difference in months among filing date and publication date for the first of the two pubblication kinds, we get the following result:


months # of applications
4 50501
5 53963
6 130103
7 111533
8 67594
9 48504
10 33086
11 26268
12 20754
13 15320
14 13855
15 11927
16 9346
17 7075
18 186961
19 32484
20 6354

Above 95% of applications listed in the first table are published before 18 months (cases in month 19 may be a rounding error), conforting so our hipothesys that the first of the 2 publication is due to the 18 month publication law.


See also:
18-month publication provision:
http://www.uspto.gov/web/offices/pac/mpep/documents/1100_1120.htm

About non-publication request:
http://www.uspto.gov/web/offices/pac/mpep/documents/appxr_1_213.htm#cfr37s1.213

Thanks to Francesco Lissoni for help in solving this issue.

Thursday, October 28, 2010

USPTO persons data quality

In a previous post I analized geographic data quality of inventors in PATSTAT, for all the data contained.
The results for USPTO where not so good...


APPLN_AUTH inventors no state no zip no country no address no city
US 5960856 86% 98% 21% 97% 25%

In order to understand how is the trend across years (if data quality is improving or noise is spread allover the data) I selected applicant/inventors who filed after 31/12/1999 applications type “A” (patents of invention);
the results was that 1% have no country and about 2% have no city (so it could be impossible to assign a region/NUTS2 code).



distinct appl/inv 2774408
ctry code blank 29858 1,08%
city blank 56119 2,02%

If we investigate by inventor/applicant county whether  there may be some nation having a BIAS (only for those countries having more than 100 inventors/applicants) we see that those who have values above average are countries outside Europe, so not included in some regional reclassifications like NUTS2.


CTRY CODE COUNT EMPTY CITY %
'' 29858 29071 97,40%
'AE' 166 2 1,20%
'AR' 1265 21 1,70%
'AT' 10330 117 1,10%
'AU' 22688 499 2,20%
'BB' 181 21 11,60%
'BE' 12980 230 1,80%
'BG' 428 5 1,20%
'BM' 249 39 15,70%
'BR' 4137 54 1,30%
'BS' 137 3 2,20%
'BY' 163 2 1,20%
'CA' 70324 800 1,10%
'CH' 23982 489 2,00%
'CL' 558 14 2,50%
'CN' 29490 180 0,60%
'CO' 251 5 2,00%
'CR' 148 20 13,50%
'CU' 547 7 1,30%
'CY' 156 5 3,20%
'CZ' 1297 34 2,60%
'DE' 178725 2477 1,40%
'DK' 10973 204 1,90%
'EE' 225 0 0,00%
'EG' 228 0 0,00%
'ES' 10369 151 1,50%
'FI' 13552 123 0,90%
'FR' 67931 881 1,30%
'GB' 76473 1480 1,90%
'GR' 801 14 1,70%
'HK' 5289 128 2,40%
'HR' 378 6 1,60%
'HU' 2316 14 0,60%
'ID' 219 8 3,70%
'IE' 5163 60 1,20%
'IL' 25108 273 1,10%
'IN' 18600 189 1,00%
'IR' 236 5 2,10%
'IS' 414 9 2,20%
'IT' 31398 516 1,60%
'JP' 479365 3346 0,70%
'KR' 115483 978 0,80%
'KW' 162 0 0,00%
'KY' 255 39 15,30%
'LI' 376 21 5,60%
'LT' 149 2 1,30%
'LU' 815 55 6,70%
'LV' 115 0 0,00%
'MC' 113 6 5,30%
'MX' 2341 25 1,10%
'MY' 2666 24 0,90%
'NL' 28215 972 3,40%
'NO' 5912 106 1,80%
'NZ' 3875 141 3,60%
'PH' 742 3 0,40%
'PK' 135 0 0,00%
'PL' 1139 5 0,40%
'PT' 735 8 1,10%
'RO' 328 0 0,00%
'RU' 6956 40 0,60%
'SA' 516 3 0,60%
'SE' 23806 343 1,40%
'SG' 8106 98 1,20%
'SI' 567 4 0,70%
'SK' 277 0 0,00%
'TH' 797 18 2,30%
'TR' 618 3 0,50%
'TW' 113480 939 0,80%
'UA' 844 1 0,10%
'US' 1311956 10561 0,80%
'VE' 456 3 0,70%
'VG' 430 48 11,20%
'YU' 103 0 0,00%
'ZA' 2572 68 2,60%

Wednesday, July 7, 2010

Inventors data in patstat: Epo vs uspto

Patstat stores data about applicants and inventors inside a table with the prefix TLS206, indexed using a field named PERSON_ID that can be linked to the applications via table TLS207_PERS_APPLN linking each person_id to an application id (appln_id)Patstat DVD provides two versions of TLS206: the first TLS206_PERSON is a comma separated value file containing mainly the fields name, address and country code.
TLS206_ASCII instead contains the same information already parsed (and also something more, see below for fields list): name is splitted into last, first and middle name; address into street, city, state and zip code.

Data origins for person names and addresses:


1) EPO Register for EP patent applications; details are those that were the most recent in the EP Register at the time of extraction of the data.
2) OECD patents database for US data post 1976-01-01 up to and including November 15th 2005 for Published Grants.
3) PATSTAT weekly file extracts from USPTO website for Published Grants from November 22nd 2005 until today;  Published Applications  from September 29th 2005 to today inclusive.
4) Inventor & Applicant names for US PTO Published Applications from March 1st 2001 to September 22nd 2005 from DOCDB ,  data-format="docdba".
5) all other names  from DOCDB , data-format="docdba" (US data for names and addresses for patents published before 1976-01-01 is taken from the EPO's DOCDB Database)

TLS206_ASCII FIELD LIST                           

prof-last-name
prof-first-name
prof-middle-names
prof-street
prof-city
prof-state
prof-zip-code
 
TLS206_person field list                      

Person_id                         
Person_name                 
Person_address                             
Person_ctry_code         

Wednesday, April 14, 2010

Counting patstat patents by application year

Here below a small table created by counting by application year patents applied @ EPO vs patents applied @ USPTO. Data have been counted from september 2009 data, starting from 1980.
Note that the peak is in 2004 for US and 2006 for EPO due to the examination lag (for both office) and also to the delay in including Us patents in patstat, for USPTO.
Nota also that 2001 sudden rise of 100% of patent count for USPTO is due to a change in the US Patent Law, since US Patent and Trademark Office was obliged to publish patent applications from 29 November 2000. However USPTO internal publishing system meant that publication actually began in March 2001.



appl year EPO pat# USPTO pat#
1980 19724 67106
1981 24959 64458
1982 28522 65614
1983 31609 62165
1984 36952 67690
1985 39375 72161
1986 43083 75925
1987 45817 82387
1988 52186 91028
1989 57727 97146
1990 63907 100332
1991 59280 101194
1992 60626 104798
1993 60066 109216
1994 62005 124374
1995 65308 145762
1996 71413 146351
1997 80488 172249
1998 90974 175349
1999 98718 193052
2000 109395 229731
2001 115110 428829
2002 112839 443539
2003 115652 458809
2004 120813 490649
2005 126739 488568
2006 129292 404225
2007 112413 328202
2008 35728 183275
2009 2909 13271

Here below a graph showing the differences...

Tuesday, February 23, 2010

Origin of citations in European Patent Office

Whenever you'd need to understand who introduces a citation to a patent in PATSTAT, you may look at  table TLS212_citation, where you may find the field CITN_ORIGIN.

The content of such field may be any of these


0 - SEA- citations introduced during search
1 - APP- citations introduced by the applicant
2 - EXA- citations introduced during examination
3 - OPP - citations introduced during opposition
4 - 115- citations introduced according to Art 115 EPC
5,6 not public



Let's see the content of this field in EPO patents (count made on PATSTAT 10/2009 grouping by publication numbers)



ORIGIN        COUNT OF CITATIONS
0                    22985072
1                     6175203
2                      178744
3                       15839
4                       13533

While in in USPTO patents..
(count made with same criteria as above)

ORIGIN        COUNT OF CITATIONS
0              37425072 
1              15534910

The higher number of applicant introduced citations is due to the "duty of candor" rule which imposes the disclosure of all prior art.