Desarquivo - dataset 04 grafo de ligações entre entidades Neo4j dataset04

Description

Grafo de ligações entre entidades e notícias (neste caso não foi preparado o comando com o neo4j-import mas aconselha-se esse face à opção LOAD CSV para datasets grandes) os dados são os mesmos do dataset 03 b mas, ao importar, são reorganizados de outra forma gerando um nó no grafo para cada notícia.

instruções de importação para neo4j:

USING PERIODIC COMMIT
LOAD CSV WITH HEADERS FROM 'file:///people.csv' AS row
MERGE (e:PER {_id: row._id, text: row.text});
USING PERIODIC COMMIT
LOAD CSV WITH HEADERS FROM 'file:///orgs.csv' AS row
MERGE (e:ORG {_id: row._id, text: row.text});
USING PERIODIC COMMIT
LOAD CSV WITH HEADERS FROM 'file:///locations.csv' AS row
MERGE (e:LOC {_id: row._id, text: row.text});
USING PERIODIC COMMIT
LOAD CSV WITH HEADERS FROM 'file:///misc.csv' AS row
MERGE (e:MISC {_id: row._id, text: row.text});



USING PERIODIC COMMIT
LOAD CSV WITH HEADERS FROM 'file:///news.csv' AS row
MERGE (n:NEWS {_id: row._id, title: row.title});



USING PERIODIC COMMIT
LOAD CSV WITH HEADERS FROM 'file:///connections_1.csv' AS row
MERGE (e1 {_id: row._id1})
MERGE (e2 {_id: row._id2})
WITH row, e1, e2
MERGE (e1)-[:rel{weight: toInteger(row.weight)}]-(e2);

Para mais informações ver: https://github.com/msramalho/desarquivo/blob/master/DATASETS.md

Author

This dataset has been published on the initiative and under the responsibility of Miguel Sozinho Ramalho.

Latest update

August 31, 2021

License

Creative Commons Attribution 4.0 - CC BY 4.0

Metadata quality
100.0/100

There are no discussions for this dataset yet.

There are no community resources for this dataset yet.

Share your resources Learn more about the community

Information

Temporality

Creation

August 31, 2021

Frequency

Punctual

Temporal coverage

2000 to 2019

Latest update

August 31, 2021

Spatial coverage

Territorial coverage granularity

Country

Actions

Embed

Statistics for the year

Reuses of this dataset

1

Followers

0