8000 Make the data compatible with Wikidata rules by strainu · Pull Request #1 · geospatialorg/date-contact-localitati · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

Make the data compatible with Wikidata rules #1

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 2 commits into
base: main
Choose a base branch
from

Conversation

strainu
Copy link
@strainu strainu commented Jun 5, 2021
A1BD

This changes introduces several changes which made it possible to import the database in Wikidata:

  • UAT names are the ones reconciled with Wikidata (this is actually an OpenRefine artefact,, might not be appropriate for all versions)
  • addresses are now on a single line (again, OpenRefine artefact)
  • add a Wikidata-id column
  • split multiple phones and emails into separate columns
  • limit the number of phone numbers to 4 (there was only one UAT with 6 numbers)
  • ensure the website and email(s) are in URI format, including the protocol (e.g. http:, mailto:)
  • ensure the phone number(s) are in RFC3966 format
  • solve misc errors (random signs in numbers, [at] in emails, double spaces etc.)

strainu added 2 commits June 5, 2021 21:55
This changes introduces several changes which made it possible to import the database in Wikidata:
- UAT names are the ones reconciled with Wikidata (this is actually an OpenRefine artefact,, might not be appropriate for all versions)
- addresses are now on a single line (again, OpenRefine artefact)
- add a Wikidata-id column
- split multiple phones and emails into separate columns
- limit the number of phone numbers to 4 (there was only one UAT with 6 numbers)
- ensure the website and email(s) are in URI format, including the protocol (e.g. http:, mailto:)
- ensure the phone number(s) are in RFC3966 format
- solve misc errors (random signs in numbers, [at] in emails, double spaces etc.)
Manually fix some websites which were returning 5xx and 4xx codes.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant
0