OpenRefine: Difference between revisions

From The QA Company
Jump to navigation Jump to search
No edit summary
Line 14: Line 14:
**Next
**Next
**Create Project
**Create Project
Nom Nationalité Date de naissance Emploi Qualification
Dennis Diefenbach Allemand/Italienne 2/15/1988 Indetermine Directeur de recherche et développement
Ali Haidar Libanais 5/10/1997 Indetermine Developpeur
Guo Kungpeng Chinois 1/13/1996 Determine Developpeur
Clement Defretiere Francais 7/2/1999 Alternance Developpeur
Lois Veni Francais 18/5/2000 Alternance Marketing
Jonathan Mallet Francais Determine Developpeur


{| class="wikitable"
{| class="wikitable"
|+
|+
|entity
|Nom
|program cci
|Nationalité
|website
|Date de naissance
|-
|<nowiki>https://linkedopendata.eu/entity/Q2463293</nowiki>
|2014GR16M1OP001
|<nowiki>https://ec.europa.eu/regional_policy/EN/atlas/programmes/2014-2020/Greece/2014GR16M1OP001</nowiki>
|-
|<nowiki>https://linkedopendata.eu/entity/Q2463518</nowiki>
|2014TC16I5CB001
|<nowiki>https://ec.europa.eu/regional_policy/EN/atlas/programmes/2014-2020/europe/2014TC16I5CB001</nowiki>
|-
|<nowiki>https://linkedopendata.eu/entity/Q2463519</nowiki>
|2014TC16I5CB002
|<nowiki>https://ec.europa.eu/regional_policy/EN/atlas/programmes/2014-2020/europe/2014TC16I5CB002</nowiki>
|-
|<nowiki>https://linkedopendata.eu/entity/Q2463520</nowiki>
|2014TC16I5CB003
|<nowiki>https://ec.europa.eu/regional_policy/EN/atlas/programmes/2014-2020/europe/2014TC16I5CB003</nowiki>
|-
|<nowiki>https://linkedopendata.eu/entity/Q4294076</nowiki>
|2014TC16I5CB004
|<nowiki>https://ec.europa.eu/regional_policy/EN/atlas/programmes/2014-2020/europe/2014TC16I5CB004</nowiki>
|-
|<nowiki>https://linkedopendata.eu/entity/Q2463524</nowiki>
|2014TC16I5CB005
|<nowiki>https://ec.europa.eu/regional_policy/EN/atlas/programmes/2014-2020/europe/2014TC16I5CB005</nowiki>
|-
|-
|<nowiki>https://linkedopendata.eu/entity/Q2463525</nowiki>
|<nowiki>Dennis Diefenbach</nowiki>
|2014TC16I5CB006
|Allemand/Italienne
|<nowiki>https://ec.europa.eu/regional_policy/EN/atlas/programmes/2014-2020/europe/2014TC16I5CB006</nowiki>
|<nowiki>2/15/1988</nowiki>
|-
|-
|<nowiki>https://linkedopendata.eu/entity/Q2463526</nowiki>
|<nowiki>Ali Haidar</nowiki>
|2014TC16I5CB007
|Libanais
|<nowiki>https://ec.europa.eu/regional_policy/EN/atlas/programmes/2014-2020/europe/2014TC16I5CB007</nowiki>
|<nowiki>5/10/1997</nowiki>
|-
|-
|<nowiki>https://linkedopendata.eu/entity/Q2463529</nowiki>
|<nowiki>Guo Kungpeng</nowiki>
|2014TC16I5CB008
|Chinois
|<nowiki>https://ec.europa.eu/regional_policy/EN/atlas/programmes/2014-2020/europe/2014TC16I5CB008</nowiki>
|<nowiki>1/13/1996</nowiki>
|-
|-
|<nowiki>https://linkedopendata.eu/entity/Q2463530</nowiki>
|<nowiki>Clement Defretiere</nowiki>
|2014TC16I5CB009
|Francais
|<nowiki>https://ec.europa.eu/regional_policy/EN/atlas/programmes/2014-2020/europe/2014TC16I5CB009</nowiki>
|<nowiki>7/2/1999</nowiki>
|-
|-
|<nowiki>https://linkedopendata.eu/entity/Q2463531</nowiki>
|<nowiki>Lois Veni</nowiki>
|2014TC16I5CB010
|Francais
|<nowiki>https://ec.europa.eu/regional_policy/EN/atlas/programmes/2014-2020/europe/2014TC16I5CB010</nowiki>
|
|-
|-
|<nowiki>https://linkedopendata.eu/entity/Q2463534</nowiki>
|<nowiki>Jonathan Mallet</nowiki>
|2014TC16M4TN001
|Francais
|<nowiki>https://ec.europa.eu/regional_policy/EN/atlas/programmes/2014-2020/europe/2014TC16M4TN001</nowiki>
|
|-
|-
|<nowiki>https://linkedopendata.eu/entity/Q2463535</nowiki>
|2014TC16M4TN002
|<nowiki>https://ec.europa.eu/regional_policy/EN/atlas/programmes/2014-2020/europe/2014TC16M4TN002</nowiki>
|}
|}


Line 114: Line 95:
</syntaxhighlight>
</syntaxhighlight>


* Press "Add Wikibase" and the Eu Knowledge Graph will appear in the list as one options to choose.
* Press "Add Wikibase" and the Wikibase will appear in the list as one options to choose.


=== Reconcile your data ===
=== Reconcile your data ===
Line 120: Line 101:
In general follow the [https://docs.openrefine.org/manual/wikibase/reconciling instructions on OpenRefine] for these steps. Here is just an example:
In general follow the [https://docs.openrefine.org/manual/wikibase/reconciling instructions on OpenRefine] for these steps. Here is just an example:


* Choose the column "entity" and click on "Reconcile -> Use Values as Identifiers"
* Choose the column "Nom" and click on "Reconcile -> Use Values as Identifiers"
* Then the column will be completely reconcile
* Then the column will be completely reconcile


Line 129: Line 110:
* Click on "add item".  
* Click on "add item".  
* Drag and drop "entity" to the field "item"
* Drag and drop "entity" to the field "item"
* Click on "add statement" and find the property "CCI ID" (P1367)
* Click on "add statement" and find the property "birth date" (P145)
* Drag and drop the column name "program cci" to the statement item
* Drag and drop the column name "Date de naissance" to the statement item
* You can now do the same for the website column with the property "info Regio url" (P1742)
* You can now check if you have some issues with the import by checking the "Issues" tab or see a preview of what you've done.
* You can now check if you have some issues with the import by checking the "Issues" tab or see a preview of what you've done.


Line 140: Line 120:
* Upload Edits
* Upload Edits


The data you ingested is now available in the Eu Knowledge Graph!
The data you ingested is now available in the Wikibase!

Revision as of 20:29, 15 August 2022

This page contains instructions to connect OpenRefine to this Wikibase instance. We offer a reconciliation service against this Wikibase that makes it easy to use OpenRefine.

Requirements

  • an account on the Wikibase instance allowing to make edits

Setup

  • Upload your data. For example
    • Create Project
    • Clipboard
    • Insert the data in the Table below
    • Next
    • Create Project

Nom Nationalité Date de naissance Emploi Qualification Dennis Diefenbach Allemand/Italienne 2/15/1988 Indetermine Directeur de recherche et développement Ali Haidar Libanais 5/10/1997 Indetermine Developpeur Guo Kungpeng Chinois 1/13/1996 Determine Developpeur Clement Defretiere Francais 7/2/1999 Alternance Developpeur Lois Veni Francais 18/5/2000 Alternance Marketing Jonathan Mallet Francais Determine Developpeur

Nom Nationalité Date de naissance
Dennis Diefenbach Allemand/Italienne 2/15/1988
Ali Haidar Libanais 5/10/1997
Guo Kungpeng Chinois 1/13/1996
Clement Defretiere Francais 7/2/1999
Lois Veni Francais
Jonathan Mallet Francais

Configure Open Refine for this Wikibase

  • On the top right click on the Extensions Wikidata button and select "Select Wikibase instance"
  • Add Wikibase
  • Paste
{
"version":"1.0",
  "mediawiki":{
     "name":"Wikibase - The QA Company",
     "root":"https://wikibase.the-qa-company.com/wiki/",
     "main_page":"https://wikibase.the-qa-company.com/wiki/Wikibase_-_The_QA_Company",
     "api":"https://wikibase.the-qa-company.com/w/api.php"
  },
  "wikibase":{
     "site_iri":"https://wikibase.the-qa-company.com/entity/",
     "tag":"",
     "maxlag":5,
     "properties":{
        "instance_of":"P5",
        "subclass_of":"P47"
     },
     "constraints":{
        "property_constraint_pid":"P58",
        "exception_to_constraint_pid":"P120",

        "constraint_status_pid":"P119",
        "mandatory_constraint_qid":"Q76",
        "suggestion_constraint_qid":"Q354",
        "distinct_values_constraint_qid":"Q425"
     }
  },
  "reconciliation":{
     "endpoint":"https://openrefine-reconciliation.wikibase.the-qa-company.com/${lang}/api"
  },
  "editgroups":{
     "url_schema":"([[:toollabs:editgroups/b/OR/${batch_id}|details]])"
  }
}
  • Press "Add Wikibase" and the Wikibase will appear in the list as one options to choose.

Reconcile your data

In general follow the instructions on OpenRefine for these steps. Here is just an example:

  • Choose the column "Nom" and click on "Reconcile -> Use Values as Identifiers"
  • Then the column will be completely reconcile

Model your data using the Wikibase Schema

In general follow the instructions on OpenRefine for these steps. Here is just an example:

  • Click on the schema tab
  • Click on "add item".
  • Drag and drop "entity" to the field "item"
  • Click on "add statement" and find the property "birth date" (P145)
  • Drag and drop the column name "Date de naissance" to the statement item
  • You can now check if you have some issues with the import by checking the "Issues" tab or see a preview of what you've done.

Upload your data

In general follow the instructions on OpenRefine for these steps. Here is just an example:

  • In the upper right click on Wikidata -> Upload Edits to Wikibase
  • Log in
  • Upload Edits

The data you ingested is now available in the Wikibase!