src/hdalab/management/commands/import_hdabo_db.py
author ymh <ymh.work@gmail.com>
Wed, 11 Apr 2018 12:19:47 +0200
branchdocumentation
changeset 693 09e00f38d177
parent 686 385e3a12ee27
permissions -rw-r--r--
Add hdabo/hdalab documentations
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
135
dd6578e36a57 translate interface
ymh <ymh.work@gmail.com>
parents:
diff changeset
     1
# -*- coding: utf-8 -*-
dd6578e36a57 translate interface
ymh <ymh.work@gmail.com>
parents:
diff changeset
     2
'''
693
09e00f38d177 Add hdabo/hdalab documentations
ymh <ymh.work@gmail.com>
parents: 686
diff changeset
     3
Cette commande transforme une base de donnée hdabo en base de donnée hdalab.
09e00f38d177 Add hdabo/hdalab documentations
ymh <ymh.work@gmail.com>
parents: 686
diff changeset
     4
En particulier cette command est utilisée dans l'import de l'extraction rdf des données HDA.
09e00f38d177 Add hdabo/hdalab documentations
ymh <ymh.work@gmail.com>
parents: 686
diff changeset
     5
09e00f38d177 Add hdabo/hdalab documentations
ymh <ymh.work@gmail.com>
parents: 686
diff changeset
     6
Cette commande enchaine en fait les commandes Django suivantes:
135
dd6578e36a57 translate interface
ymh <ymh.work@gmail.com>
parents:
diff changeset
     7
693
09e00f38d177 Add hdabo/hdalab documentations
ymh <ymh.work@gmail.com>
parents: 686
diff changeset
     8
    - `migrate <https://docs.djangoproject.com/en/1.8/ref/django-admin/#django-admin-migrate>`_
09e00f38d177 Add hdabo/hdalab documentations
ymh <ymh.work@gmail.com>
parents: 686
diff changeset
     9
    - :mod:`query_wikipedia_category \-f \-\-all <hdalab.management.commands.query_wikipedia_category>` (si l'option est passée)
09e00f38d177 Add hdabo/hdalab documentations
ymh <ymh.work@gmail.com>
parents: 686
diff changeset
    10
    - :mod:`query_dbpedia \-f \-\-all <hdalab.management.commands.query_dbpedia>`
09e00f38d177 Add hdabo/hdalab documentations
ymh <ymh.work@gmail.com>
parents: 686
diff changeset
    11
    - :mod:`fill_tag_years <hdalab.management.commands.fill_tag_years>`
09e00f38d177 Add hdabo/hdalab documentations
ymh <ymh.work@gmail.com>
parents: 686
diff changeset
    12
    - :mod:`geojson_transform \<chemin répertoire data\>/countries.geo.json <hdalab.management.commands.geojson_transform>`
09e00f38d177 Add hdabo/hdalab documentations
ymh <ymh.work@gmail.com>
parents: 686
diff changeset
    13
    - :mod:`query_geo_inclusion <hdalab.management.commands.query_geo_inclusion>`
09e00f38d177 Add hdabo/hdalab documentations
ymh <ymh.work@gmail.com>
parents: 686
diff changeset
    14
    - :mod:`import_insee_csv \<chemin répertoire data\>/villes.csv <hdalab.management.commands.import_insee_csv>`
09e00f38d177 Add hdabo/hdalab documentations
ymh <ymh.work@gmail.com>
parents: 686
diff changeset
    15
    - :mod:`import_insee_csv \<chemin répertoire data\>/additional_cities.csv <hdalab.management.commands.import_insee_csv>`
09e00f38d177 Add hdabo/hdalab documentations
ymh <ymh.work@gmail.com>
parents: 686
diff changeset
    16
    - :mod:`import_hda_insee_csv \<chemin répertoire data\>/HDA_Insee.csv <hdalab.management.commands.import_hda_insee_csv>`
09e00f38d177 Add hdabo/hdalab documentations
ymh <ymh.work@gmail.com>
parents: 686
diff changeset
    17
    - :mod:`query_category_inclusion \-f \-\-all <hdalab.management.commands.query_category_inclusion>`
09e00f38d177 Add hdabo/hdalab documentations
ymh <ymh.work@gmail.com>
parents: 686
diff changeset
    18
09e00f38d177 Add hdabo/hdalab documentations
ymh <ymh.work@gmail.com>
parents: 686
diff changeset
    19
**Usage**: ``django-admin import_hdabo_db [options] [<chemin répertoire data>]``
09e00f38d177 Add hdabo/hdalab documentations
ymh <ymh.work@gmail.com>
parents: 686
diff changeset
    20
09e00f38d177 Add hdabo/hdalab documentations
ymh <ymh.work@gmail.com>
parents: 686
diff changeset
    21
**Options spécifiques:**
09e00f38d177 Add hdabo/hdalab documentations
ymh <ymh.work@gmail.com>
parents: 686
diff changeset
    22
09e00f38d177 Add hdabo/hdalab documentations
ymh <ymh.work@gmail.com>
parents: 686
diff changeset
    23
    - *\-c,\-\-categories*: ajoute la commande :mod:`query_wikipedia_category \-f \-\-all <hdalab.management.commands.query_wikipedia_category>` à la chaîne de traitements.
09e00f38d177 Add hdabo/hdalab documentations
ymh <ymh.work@gmail.com>
parents: 686
diff changeset
    24
135
dd6578e36a57 translate interface
ymh <ymh.work@gmail.com>
parents:
diff changeset
    25
'''
dd6578e36a57 translate interface
ymh <ymh.work@gmail.com>
parents:
diff changeset
    26
from django.core.management.base import BaseCommand
dd6578e36a57 translate interface
ymh <ymh.work@gmail.com>
parents:
diff changeset
    27
from django.core.management import call_command
dd6578e36a57 translate interface
ymh <ymh.work@gmail.com>
parents:
diff changeset
    28
from optparse import make_option
dd6578e36a57 translate interface
ymh <ymh.work@gmail.com>
parents:
diff changeset
    29
import os.path
dd6578e36a57 translate interface
ymh <ymh.work@gmail.com>
parents:
diff changeset
    30
dd6578e36a57 translate interface
ymh <ymh.work@gmail.com>
parents:
diff changeset
    31
dd6578e36a57 translate interface
ymh <ymh.work@gmail.com>
parents:
diff changeset
    32
class Command(BaseCommand):
dd6578e36a57 translate interface
ymh <ymh.work@gmail.com>
parents:
diff changeset
    33
dd6578e36a57 translate interface
ymh <ymh.work@gmail.com>
parents:
diff changeset
    34
    args = "[<path to the data folder>]"
dd6578e36a57 translate interface
ymh <ymh.work@gmail.com>
parents:
diff changeset
    35
    option_list = BaseCommand.option_list + (
dd6578e36a57 translate interface
ymh <ymh.work@gmail.com>
parents:
diff changeset
    36
        make_option('-c', '--categories',
dd6578e36a57 translate interface
ymh <ymh.work@gmail.com>
parents:
diff changeset
    37
            action='store_true',
dd6578e36a57 translate interface
ymh <ymh.work@gmail.com>
parents:
diff changeset
    38
            dest='categories',
dd6578e36a57 translate interface
ymh <ymh.work@gmail.com>
parents:
diff changeset
    39
            default=False,
dd6578e36a57 translate interface
ymh <ymh.work@gmail.com>
parents:
diff changeset
    40
            help='load categories in db'),
dd6578e36a57 translate interface
ymh <ymh.work@gmail.com>
parents:
diff changeset
    41
    )
dd6578e36a57 translate interface
ymh <ymh.work@gmail.com>
parents:
diff changeset
    42
dd6578e36a57 translate interface
ymh <ymh.work@gmail.com>
parents:
diff changeset
    43
686
385e3a12ee27 Containerization and various corrections to make it work
ymh <ymh.work@gmail.com>
parents: 571
diff changeset
    44
    def handle(self, *args, **options):
385e3a12ee27 Containerization and various corrections to make it work
ymh <ymh.work@gmail.com>
parents: 571
diff changeset
    45
135
dd6578e36a57 translate interface
ymh <ymh.work@gmail.com>
parents:
diff changeset
    46
        if len(args) == 0:
284
f01235a1dcc2 correct ontology requests
cavaliet
parents: 272
diff changeset
    47
            data_path = os.path.abspath(os.path.join(os.path.abspath(__file__),'../../../../../data'))
135
dd6578e36a57 translate interface
ymh <ymh.work@gmail.com>
parents:
diff changeset
    48
        else:
dd6578e36a57 translate interface
ymh <ymh.work@gmail.com>
parents:
diff changeset
    49
            data_path = args[0]
686
385e3a12ee27 Containerization and various corrections to make it work
ymh <ymh.work@gmail.com>
parents: 571
diff changeset
    50
228
92d733f0d179 Add some trace.
ymh <ymh.work@gmail.com>
parents: 227
diff changeset
    51
        print("===========  MIGRATE ===========")
571
d9642be7c937 replace commit_on_success with atomic
ymh <ymh.work@gmail.com>
parents: 443
diff changeset
    52
        call_command('migrate')
135
dd6578e36a57 translate interface
ymh <ymh.work@gmail.com>
parents:
diff changeset
    53
        if options.get('categories', False):
228
92d733f0d179 Add some trace.
ymh <ymh.work@gmail.com>
parents: 227
diff changeset
    54
            print("===========  QUERY WIKIPEDIA CATEGORY ===========")
443
27f71b0a772d next version of import_rdf
ymh <ymh.work@gmail.com>
parents: 284
diff changeset
    55
            call_command('query_wikipedia_category', interactive=False, force=True, all=True)
228
92d733f0d179 Add some trace.
ymh <ymh.work@gmail.com>
parents: 227
diff changeset
    56
686
385e3a12ee27 Containerization and various corrections to make it work
ymh <ymh.work@gmail.com>
parents: 571
diff changeset
    57
        print("===========  QUERY DBPEDIA ===========")
443
27f71b0a772d next version of import_rdf
ymh <ymh.work@gmail.com>
parents: 284
diff changeset
    58
        call_command('query_dbpedia', interactive=False, force=True, all=True)
228
92d733f0d179 Add some trace.
ymh <ymh.work@gmail.com>
parents: 227
diff changeset
    59
        print("===========  FILL TAG YEAR ===========")
135
dd6578e36a57 translate interface
ymh <ymh.work@gmail.com>
parents:
diff changeset
    60
        call_command('fill_tag_years')
228
92d733f0d179 Add some trace.
ymh <ymh.work@gmail.com>
parents: 227
diff changeset
    61
        print("===========  GEOJSON TRANSFORM ===========")
227
b0cd3e6e31c7 Update readmin on hdabo -> hdalab migration and update import_hdabo_db management commend
ymh <ymh.work@gmail.com>
parents: 152
diff changeset
    62
        call_command('geojson_transform', os.path.join(data_path,'countries.geo.json'))
228
92d733f0d179 Add some trace.
ymh <ymh.work@gmail.com>
parents: 227
diff changeset
    63
        print("===========  QUERY GEO INCLUSION ===========")
152
e2ceeb903de6 improve requests
ymh <ymh.work@gmail.com>
parents: 135
diff changeset
    64
        call_command('query_geo_inclusion')
228
92d733f0d179 Add some trace.
ymh <ymh.work@gmail.com>
parents: 227
diff changeset
    65
        print("===========  IMPORT INSEE CSV PASS 1 ===========")
135
dd6578e36a57 translate interface
ymh <ymh.work@gmail.com>
parents:
diff changeset
    66
        call_command('import_insee_csv', os.path.join(data_path,'villes.csv'))
228
92d733f0d179 Add some trace.
ymh <ymh.work@gmail.com>
parents: 227
diff changeset
    67
        print("===========  IMPORT INSEE CSV PASS 2 ===========")
135
dd6578e36a57 translate interface
ymh <ymh.work@gmail.com>
parents:
diff changeset
    68
        call_command('import_insee_csv', os.path.join(data_path,'additional_cities.csv'))
228
92d733f0d179 Add some trace.
ymh <ymh.work@gmail.com>
parents: 227
diff changeset
    69
        print("===========  IMPORT HDA INSEE CSV ===========")
152
e2ceeb903de6 improve requests
ymh <ymh.work@gmail.com>
parents: 135
diff changeset
    70
        call_command('import_hda_insee_csv', os.path.join(data_path,'HDA_Insee.csv'))
228
92d733f0d179 Add some trace.
ymh <ymh.work@gmail.com>
parents: 227
diff changeset
    71
        print("===========  QUERY CATEGORY INCLUSION ===========")
230
2185b943cabd improve database import
ymh <ymh.work@gmail.com>
parents: 228
diff changeset
    72
        call_command('query_category_inclusion', all=True, force=True, interactive=False)
686
385e3a12ee27 Containerization and various corrections to make it work
ymh <ymh.work@gmail.com>
parents: 571
diff changeset
    73