Clement Levallois, Assistant Professor at EMLYON Business School.
Convert Excel and csv files into networks
This plugin helps transform your Excel files (or csv files) into a network, directly imported into Gephi. You can choose which entities, and which relations, form the network.
New since plugin version 1.4: you can create dynamic networks if your Excel file includes a column with a year or date indication.
1. Quick how-to
- Install the plugin in Gephi (File -> Tools -> Plugins -> Available).
- Restart Gephi
In the Gephi menu, select File -> Import Spigot.
- Select “Data importer”
- Get your Excel file ready!
- Follow the steps of the wizard.
2. To go further…
The plugin gives you the possibility to create network of two types of entities. Let’s say that your Excel file looks like:
|Author||Book||Year||Sales of the book|
|David||My life||1972||350 copies|
|Mary||My story||1968||400 copies|
In the wizard you’ll get to a window where you’ll select “Author” and “Book” as the two types of entities you want to connect:
(click to enlarge)
And the result in Gephi will be:
This looks fantastic, but in practice this type of networks gets quickly messy – lots of connections between both types of agents, making it hard to disentangle and to visualize.
-> For this reason, I’d suggest to also try the plugin on this type of files:
|Author||Book||Year||Sales of the book||Literary agent|
|David||My life||1972||350 copies||Philip|
|Mary||My story||1968||400 copies||Philip|
Here, by selecting a network made of ‘Authors’ and ‘Literary Agent’ in the plugin, the resulting network will be much more interesting (because it is made of one type of agents: persons).
So in the plugin you select (click to enlarge):
And this leads to something more meaningful, in my opinion at least:
3. Dynamics: what are the correct formats for time?
You need a column that includes indications of time. Several possibilties:
1) Years or dates
Dates must be formatted in yyy-mm-dd: 2012-06-27
Years are just yyyy: 1968
2) Durations or single values:
- Durations are just a start and end time separated with a comma: 2012-06-27,2014-01-01
Or for a duration in years: 1958, 1999
- single values are, well: just a year: 1876 or just a date: 1879-03-19
4. Issues and feature requests?
Please post your comments and questions on Github: https://github.com/seinecle/My-Plugins-for-Gephi/issues
5. Release history
Initial release (Nov. 2013).
Version 1.1 (Nov. 2013). Added error messages helping the user navigate the plugin.
Version 1.3 (March 2014). Bug fix: same names in different columns are now treated correctly. Bug report / analysis by Wouters A.H. Spekkink.
Version 1.4 (September 2014). Time dimension added: if your file includes a column with a time stamp, the nodes and edges will be created at this time. Durations also possible. More info in the wizzard included in the plugin.
Version 1.4.1 (September 2014). Bug fix: time intervals can now include spaces after the comma: “2013-05-13, 2013-05-14″ (previously, only ”2013-05-13,2013-05-14″ would have worked.) Bug report by Anne van Summeren.
Version 1.4.2 (September 2014). Bug fix: the names of entities to be connected had to be textual in Excel : “45″ would not be a correct value. Version 1.4.2 corrects that and adds minor bug fixes. Bug report by Prem Sankar C.
Version 1.4.3 (November 2014). Bug fix: Null pointer exception when nodes have empty values. Bug report by Xin Xin of University of North Texas.