Batch 1
Fetching data from the production db, cleaning it and populating in sandbox for testing against backend data querying
Data Source: Production DB
Data Destination: Sandbox
Target: Organizations
Export Query:
MATCH (n:Organization) WHERE NOT(n.ein="" OR n.description="")
WITH n
MATCH (n)-[:LOCATED_IN]->(l:Location)
MATCH (n)-[:CALL_WITH]->(p:Phone)
MATCH (n)-[:CONTACT_AT]->(c:Contact)
RETURN n.deductibility,n.subsection,n.assetAmount,n.description,n.ein,
n.latest990,n.subOpCategory,n.deductibilityCode,n.affiliation,n.foundationStatus,
n.opCategory,n.id,n.accountingPeriod,n.email,n.nteeLetter,n.nteeType,
n.incomeAmount,n.nteeSuffix,n.filingRequirement,n.alternateName,
n.classification,n.url,n.rulingDate,n.nteeCode,n.groupName,n.name,n.tagline,
n.nteeClassification,n.exemptOrgStatus,n.exemptOrgStatusCode
Export Format: Json
Total: 8547
Data Processing: The resulting descriptions were found incompatible for json parsing. we manually made corrections to the description to match json format.
Data repopulating script: https://github.com/PhilanthroLab/irs-uploader/blob/main/db_repopulation_batch1.js
Post data population activities:
create fulltext index:
CALL db.index.fulltext.createNodeIndex("<
indexName>",["NodeName"],["NodeProperty1",])
Data testing and validation against codebase breaking
Last updated
Was this helpful?