Thursday, May 26, 2011

Patstat Plugins

This will be a list of data that can be easily added to patstat (with easily I mean with a join by appln_id or person_id)

EEE_PPAT

The EEE-PPAT table is an extension of the PERSON TABLE produced by ECOOM (Catholic University of Leuven) and Eurostat. The extension concerns sector allocation of applicants and two levels of name harmonization of applicants.

It can be required at no cost by contacting technoinfo@ecoom.be; some documentation @ this link

It contains the following columns with original PATSTAT data:
        - PERSON_ID
        - PERSON_CTRY_CODE
        - DOC_STD_NAME_ID
        - PERSON_NAME
        - PERSON_ADDRESS

It contains the following additional columns:
        - HRM_LEVEL1: harmonized name level1 (see paper NAME_HARMONIZING(LEVEL1).pdf)
        - HRM_LEVEL2: harmonized name level2 (see paper HARMONIZING_HARMONIZED_PATENTEE_NAMES(LEVEL2).pdf)
        - SECTOR : Sector of assignee name (see paper SECTOR ALLOCATION.pdf)

The file is coded in UTF-8, tab-delimited.
The definition of table is as follow:
        - PERSON_ID                       number(9)
        - PERSON_CTRY_CODE    char  (2)
        - DOC_STD_NAME_ID       number(9)
        - PERSON_NAME                char  (400)
        - PERSON_ADDRESS         char  (500)
        - HRM_LEVEL1                  char  (400)
        - HRM_LEVEL2                  char  (400)
        - SECTOR                            char  (50)

HAN DATABASE

On same topic we have OECD HAN DATABASE, produced by OECD, that provides a dictionary of applicants’ names which have been elaborated with business register data, so that it can easily be matched by all users. The data is based on applicant's for patents filed to the EPO and through PCT. The dataset is complementary to Eurostat's method for harmonising applicants names.

Linkage can be both via publication numbers as via person_id.

This is also for free and can be required via mail @ sti.contact@oecd.org

INPADOC LEGAL STATUS

On different topic, we also have inpadoc legal status table (produced by EPO) that is linked to the core DB via Appln_id. As a amtter of fact the data table is named TLS221, following patstat convention.

Such data have already been describer in this blog here.

The can be acquired (yes, they are not for free) from EPO.

No comments:

Post a Comment