Showing posts with label geocoding. Show all posts
Showing posts with label geocoding. Show all posts

Wednesday, April 18, 2018

Welcome back REGPAT (+ MySQL upload scripts)

After a couple of years OECD released a new version of REGPAT, the PATSTAT plug and play extension that provides for EP and PCT person_ids an efficient NUTSIFICATION.
This edition provides regional data for 45 countries, exceeding thus the classing EU28 by adding AU, CA, ZA, US etc.
NUTS version is 2013, only London NUTS remained to version 2010.

At this link are available my MySQL scripts for uploading the tables.

Along with REGPAT also are available, by filling the form at web page:
https://survey.oecd.org/Survey.aspx?s=94638c7a1e694786a72d54c620e08ec8

the following datasets from OECD

OECD Patent Quality Indicators Database (series of indicators capturing the technological and economic value of EPO and USPTO patents);
OECD Triadic Patent Families Database (set of patents jointly filed at EPO, JPO and USPTO);
OECD Citations Database (references to patent and non-patent literature cited in EPO, USPTO or PCT patents);
OECD HAN database (harmonised patent applicants’ names).


Monday, December 18, 2017

Maxmind dataset for geolocation

 At web page:

https://www.maxmind.com/en/open-source-data-and-api-for-ip-geolocation

Maxmind delivers some dataset useful for geolocation:

GeoLite2 databases are free IP geolocation databases; The GeoLite2 Country and City databases are updated on the first Tuesday of each month. The GeoLite2 ASN database is updated every Tuesday.

IP Geolocation Usage

IP geolocation is inherently imprecise. Locations are often near the center of the population. Any location provided by a GeoIP database should not be used to identify a particular address or household.
Use the Accuracy Radius as an indication of geolocation accuracy for the latitude and longitude coordinates we return for an IP address. The actual location of the IP address is likely within the area defined by this radius and the latitude and longitude coordinates.

Includes city, region, country, latitude and longitude. This product doesn't contain any IP addresses.

Includes the following fields: ( Technical Details )

  • Country Code
  • ASCII City Name
  • City Name
  • Region
  • Population
  • Latitude (The approximate latitude of the postal code, city, subdivision or country associated with the IP address.*)
  • Longitude (The approximate latitude of the postal code, city, subdivision or country associated with the IP address.*)

Another interseting dataset, even if no more mantained is World cities with population:

City/state/country text as it appears in source files is algorithmically matched against a master geocode file from Google and MaxMind open source files.

Friday, November 17, 2017

Alternative coordinates systems

Geohash is a public domain geocoding system invented by Gustavo Niemeyer; it encodes a geographic location into a short string of letters and digits. It is hierarchical thus is possible to have arbitrary precision and the possibility of gradually removing characters from the end of the code to reduce its size (and gradually lose precision). As a consequence of the gradual precision degradation, nearby places will often (but not always) present similar prefixes. The longer a shared prefix is, the closer the two places are.

http://geohash.org/

query example:
http://geohash.org/?q=45.37,-121.7&format=url&redirect=0


What3words is a geocoding system that encodes geographic coordinates into three dictionary words with a resolution of three metres.  For example, the torch of the Statue of Liberty is located at "toned.melt.ship".
the idea behind is that a triple of words is easier to remember than a long sequence of numbers that is the usual lat-long coordinates system.
What3words has a website, apps for iOS and Android, and an API that enables bidirectional conversion of what3words address and latitude/longitude coordinates.
The grid is two-dimensional, so the addressing scheme does not distinguish between floors in a building. The system supports 14 languages, although each language covers the world's entire land areas.

This website
https://www.geocachingtoolbox.com/index.php?lang=en&page=w3w
allows to test conversion between lat-long and w3w.

Thursday, January 5, 2012

Some useful links about NUTS

As Eurostat website says,the NUTS classification is a hierarchical system for dividing up the economic territory of the EU for the purpose of :


The current NUTS classification valid from 1 January 2008 until 31 December 2011 lists 97 regions at NUTS 1, 271 regions at NUTS 2 and 1303 regions at NUTS 3 level.
 
The law regulation extablishing NUTS can be downlaodin in PDF from this link

Here is NUTS3 classification from 2003 taken from "NUTS_2003_03M.mdb", table "NUTS_AT_2003" (part of zip achieve "NUTS_RG_03M_2003_0.zip" available at Eurostat

Eventually here can be downloaded a corrrespondance table among zip codes and nuts for most EU countries.








Tuesday, August 31, 2010

how to add NUTS to patent data in patstat

As cited in a previous post, when trying to implement a geographic reclassification of patents applicants and inventors, NUTS  classification provides a hierarchical system that lists, for EU, 97 regions at NUTS 1, 271 regions at NUTS 2 and 1303 regions at NUTS 3.


One road may be to use OECD REGPAT database, that covers patent applications filed to the EPO and PCT patents at international phase. Data mainly derives from PATSTAT [this means, if you use the same ediction of your patstat, you may link it via person_id]. The regional allocation is based on the latest Nomenclature of Territorial Units for Statistics (NUTS), 2006 for European regions, and on Territorial Level 3 for other countries; datasets are available for download to researchers: by sending an e-mail to sti.contact@oecd.org. Further methodological information are available on the OECD web site at http://www.oecd.org/sti/ipr-statistics.

In order to make a home-made linkage to NUTS3 may also be useful the link between postcodes and NUTS codes, that is freely available for following 20 countries at this eurostat web page:
http://epp.eurostat.ec.europa.eu/portal/page/portal/nuts_nomenclature/correspondence_tables/postcodes_and_nuts

Tuesday, March 16, 2010

EPO patenting activity by NUTS3

Recently I had to create a benchmark for the patenting activity on EPO patents (based on patstat 10/2009) by weighting each patent by inventors' nuts3.

The result was someway amazing 'cause it seems that the TOP25 (I'm publishing below here) has many EU areas in a higher position than the first US area. The first EU area is from Holland, not from Germany as we could expect from te total count of patents...


NUTS3 CY COUNT LOCATION
JPC13 JP 102503,082 TOKYO
JPF27 JP 32278,605 OSAKA
NL414 NL 22388,468 EINDHOVEN
JPE23 JP 22307,48 AICHI
FR101 FR 21689,393 PARIS
DE212 DE 16782,779 MUNCHEN
JPC11 JP 16390,328 SAITAMA
JPF28 JP 14888,869 KOBE
FR105 FR 14340,232 BOULOGNE BILLANCOURT
ITC45 IT 14030,154 MILANO
JPC08 JP 13297,203 YOKOHAMA
JPC12 JP 11934,707 CHIBA
25017 US 11879,31 FRAMINGHAM/ACTON
SE010 SE 11489,587 STOCKHOLM
KR013 KR 11158,808 SUWON-SI
JPC22 JP 11031,511 SHIZUOKA
CH040 CH 10752,436 ZURICH
36055 US 10563,428 ROCHESTER
DE300 DE 10329,677 BERLIN
06037 US 10057,301 REDONDO
JPF26 JP 9562,31 KYOTO
06085 US 9378,459 MOUNTAIN VIEW
JPD17 JP 9260,929 YOKOHAMA-SHI
DE115 DE 8883,498 LUDWIGSBURG
FR714 FR 8711,347 GRENOBLE
FR103 FR 8603,364 VERSAILLES