ЕСОЗ - публічна документація
RC_Offline comparison of divorce acts with Persons (DRACS 2.0)
Purpose
This process fetches divorce acts data from mimir service and uses it for comparison with persons data.
Key points
This process uses cron parameter to configure its start time.
This process is performed in mpi scheduler pod.
Each divorce act is compared by husband data and by wife data separately.
Configuration
Value | Description | Example |
---|---|---|
PERSONS_DRACS_DIVORCE_ACTS_COMPARE_SCHEDULE | Cron parameter, represents start time of offline comparison of divorce acts with persons process |
|
PERSONS_DRACS_DIVORCE_ACTS_COMPARE_BATCH_SIZE | Number of divorce acts that will be processed in one cycle of the process |
|
PERSON_DRACS_DIVORCE_ACT_PARTIAL_MATCH_SCORE | Upper limit of comparison score for black zone |
|
Service logic
Offline comparison job starts according to PERSONS_DRACS_DIVORCE_ACTS_COMPARE_SCHEDULE cron parameter.
Step 1. Divorce acts list for comparison
Get list of divorce acts from that are ready to be compared to persons - perform RPC call to mimir service,
dracs_divorce_acts
table, with following parameters:ar_op_name = ‘1' or '4’
persons_compare_status = ‘READY’
Sort obtained list in ascending order by
dracs_divorce_acts.updated_at
fieldLimit obtained list with PERSONS_DRACS_DIVORCE_ACTS_COMPARE_BATCH_SIZE value
In case no divorce acts were fetched - end process.
Step 2. Data preparation and potential candidates select
Get divorce act from obtained list, update its status in dracs_divorce_acts table:
set persons_compare_status = ‘IN_PROCESS’
Get husband data for comparison process if mn_old_surname <> mn_surname and these fields are not empty
Get wife data for comparison process if wmn_old_surname <> wmn_surname and these fields are not empty
Prepare divorce act data for comparison process
perform regexp for each of the fields:
for mn_name, wmn_name, mn_old_surname, wmn_old_surname, mn_patronymic and wmn_patronymic change:
[ --'] to ''
'є' to 'е'
'и' to 'і'
for mn_series_numb and wmn_series_numb change:
[ /%#№ _-] to ''
for mn_date_birth and wmn_date_birth change mask from
dd.mm.yyyy
toyyyy-mm-dd
for mn_numident and wmn_numident normalize value - check that it is 10 symbols, if not true - set null
Get active persons (is_active=true, status=active) as candidates for comparison with husband or wife data from divorce act from MPI db using following predicate blocks:
tax_id
documents.number
birth_date + last_name
settlement_id + last_name
In case no candidates were found for husband and/or wife, update divorce acts persons compare status:
set persons_compare_status = ‘PROCESSED’
Go to next divorce act in obtained list (return to p.1 of Step 2)
Step 3. Divorce act comparison process
Get person from obtained candidates list
Prepare persons data for comparison process according to Deduplication process NEW | Data cleaning and preparation
Define person’s document for Deduplication process:
If passport document type is present in the act (DocType=1), then choose person’s PASSPORT or NATIONAL_ID depending of the passport number in the act by regex.
If other document type is present in the act, then submit a list of documents to deduplication process: BIRTH_CERTIFICATE, COMPLEMENTARY_PROTECTION_CERTIFICATE PERMANENT_RESIDENCE_PERMIT, REFUGEE_CERTIFICATE, TEMPORARY_CERTIFICATE, TEMPORARY_PASSPORT
Compare divorce act data with person data using logistic regression method, as implemented in Deduplication process:
For each variable field use separate calculation process based on the table below.
Calculate final comparison score between divorce act and person.
Step 4. Verification candidates
Before performing this step, check that divorce act record is still in persons_compare_status = 'IN_PROCESS'.
In case status is changed - skip this step, go to next divorce act in obtained list (return to p.1 of Step 2).
Based on divorce act and person comparison score, there are two flows that can be performed:
white zone or grey zone (comparison score is greater than PERSON_DRACS_DIVORCE_ACT_PARTIAL_MATCH_SCORE value)
black zone (comparison score is less than PERSON_DRACS_DIVORCE_ACT_PARTIAL_MATCH_SCORE value)
Step 4.1. White or grey zone
White or grey zone indicates that divorce act possibly relates to person, therefore it can be stated that person changed its surname after divorce and it can be used by doctor as supporting information for update person process.
Create verification candidate between divorce act and person in MPI db,
person_verification_candidates
table, set values:id = autogenerate uuid
person_id = id of a person from
mpi.persons
entity_id = id of divorce act from
mimir.dracs_divorce_acts
entity_type = ‘dracs_divorce_act’
status = ‘NEW’
config = variables that were used in comparison process (p.3 of Step 3)
details = additional details of comparison process
score = logistic regression comparison score
inserted_at = now()
updated_at = now()
if active verification candidate between person_id and dracs_divorce_act_id already exists in person_verification_candidates table in status = ‘NEW’ - skip this pair
Update persons verification status in
person_verifications
table based on current dracs name change verification status of person:in case dracs_name_change_verification_status = ‘VERIFICATION_NOT_NEEDED’ or ‘VERIFIED’ - update persons dracs name change verification status:
set dracs_name_change_verification_status = ‘VERIFICATION_NEEDED’
set dracs_name_change_verification_reason = ‘AUTO_OFFLINE’
set updated_at = now()
set updated_by = system_user()
Step 4.2. Black zone
Black zone indicates that divorce act does not relate to person, therefore it can be stated that person highly likely did not change its last name.
No further actions should be taken with person or verification candidates.
Step 5. Processed divorce act
When divorce act is fully compares with all candidates from list, change its status in
mimir.dracs_divorce_acts
tableset persons_compare_status = ‘PROCESSED’
Go to next divorce act in obtained list (return to Step 2)
ЕСОЗ - публічна документація