# Visualization of a directed network with focus on its hierarchy and circularity

- 310 Downloads

## Abstract

The spring-electric model is a useful tool to visualize a large-scale complex network. However, information on the flow of directed network may not be properly reflected because links are basically treated as undirected. Here, we propose a new visualization method with an explicit account of network flow structure information by combining Helmholtz–Hodge decomposition and the spring-electric model. We then demonstrate its effectiveness by adopting actual Japanese production flow network as a test ground. The Helmholtz–Hodge decomposition enables us to break down flow on a directed network into two flow components: potential flow and circular flow. The potential flow between a pair of nodes is given by difference of their potentials, and hence, the potential of a node shows its hierarchical position in a network. On the other hand, the circular flow component illuminates feedback loops built in a network. We also identify dominant clusters of firms forming feedback loops by applying a flow-based community detection method to the extracted circular flow network. We find that both hierarchical and loop structures coexist within the major industries such as construction, manufacturing, and wholesales.

## Keywords

Visualization Directed graph Helmholtz–Hodge decomposition Community detection Puroduction network## Introduction

It is difficult to analyze all connections in complex networks, because of their multiplicity and complexity. Visualization, which is gaining popularity owing to the recent development of graphics technology, is a useful tool to illuminate structural properties of networks. Appropriate depiction of a complex network greatly helps in grasping its intricate structures by providing an intuitive understanding. Various algorithms have been developed to visualize networks. A spring-electric model is adopted in which pairs of nodes with direct relations are physically connected by springs, and nodes in any pair repel each other through a repulsive Coulomb force [6]. The attractive force of the spring keeps intimate nodes close in space. On the other hand, the repulsive Coulomb force tends to distribute firms uniformly over the available space and prevents entanglement of the network. Although the spring-electric model is a useful method of visualization, information on the flow of the network can not be reflected, since the links are basically treated as undirected. A visualization method considering the direction of the link has also been proposed [5]. In this study, we propose a visualization method considering the information of the hierarchy and circular structure of the network, and report the results applied to actual Japanese production network. The proposed method first decomposes the network into a potential flow network and a loop flow network by Helmholtz–Hodge decomposition. This decomposition determines the Helmholtz–Hodge potential corresponding to the hierarchical position in the network for each node of the network. Layout of visualization is determined by the spring-electric model with adding constraints of coordinate corresponding to this Helmholtz–Hodge potentials. In the previous research [6], determining the layout corresponds to extracting the hierarchical structure, but in the proposed method, first of all, a hierarchical structure is uniquely obtained by Helmholtz–Hodge decomposition and then a layout according to it is determined.

Very recently, we have studied [3] structure of a Japanese production network with one million firms and five million supplier–customer links. We first constructed a directed network from the actual data of interfirm transaction relations and found that they form a tightly knit structure with a giant strongly connected component surrounded by two half-shells constituting incoming-flow and outgoing-flow components for the core. The objective of this study is to advance the previous empirical analysis [3] on the industrial flow structure embedded in microscopic supplier–buyer relations with a special emphasis on its hierarchy and circularity. Hierarchy of the production network is expected to emerge from self-organization of supply chain in the industrial system. We also note that inner loops of production, giving rise to a nonlinear feedback mechanism to complicate dynamics of the industrial system, can be an engine for economic growth.

## Methods

### Helmholtz–Hodge decomposition

*i*to node

*j*as

*i*to node

*j*which is given by

*i*and \(w_{ij}\) is a positive weight for linkage between nodes

*i*and

*j*. In the potential flow network, nodes are perfectly ranked; the potential flow thereby runs from a node with higher potential to a node with lower potential. On the other hand, the second term \(F^\mathrm{{(c)}}_{ij}\) denotes the circular flow component in which incoming flow and outgoing flow are exactly balanced at each node:

### Flow-based community detection

*L*(

*C*) measures the per step average description length of dynamics of a random walker migrating through links between nodes of a network with a given node partition \(C=\{C_{1},\ldots ,C_{\ell }\}\) and consists of two parts. The first term arises from movements of the random walker across communities, where \(q_\curvearrowright\) is the probability that the random walker switches communities and

*H*(

*C*) is the average description length of the community index codewords given by the Shannon entropy. The second term arises from movements of the random walker within communities, where \(p^i_\circlearrowright\) is the fraction of the movements within community \(C_{i}\) and \(H(\mathcal {P}^i)\) is the entropy of codewords in module codebook

*i*.

If the network has densely connected parts in which a random walker stays long time, one can compress the description length of the random walk dynamics on a network by a two-level codebook for nodes adapted to such a community structure, an analogy to geographical maps in which different cities recycle the same street names such as main street. Therefore, obtaining the best community decomposition in the map equation framework amounts to searching for the node partition that minimizes the average description length *L*(*C*). The code of the map equation algorithm is available at http://www.mapequation.org.

### Visualization based on a spring-electric model

A spring-electric model is adopted in which pairs of nodes with direct relations are physically connected by springs, and nodes in any pair repel each other through a repulsive Coulomb force. The attractive force of the spring keeps intimate nodes close in space. On the other hand, the repulsive Coulomb force tends to distribute firms uniformly over the available space and prevents entanglement of a network. We then take full advantage of a molecular dynamics (MD) method [1, 4] for an optimized configuration of nodes in the model. The ground state in the model is a leading candidate for this configuration. The MD simulation works well to reproduce an ordered structure, with the lowest-energy forms such as crystals of materials generated through slow cooling, starting from any initial configuration. We expect that the simulation is also successful in visualizing the network.

*l*and

*m*for the model is explicitly written as

*l*. Here we neglect the direction of links (flow of goods or money) and assume that \(q_l\) and \(k_{lm}\) take on identical values for every node and pair, respectively.

## Results and discussion

The present analysis is based on a big data of 4,974,802 transaction relations between 1,066,037 firms in Japan which was collected by the Tokyo Shoko Research, Ltd. in 2016.^{1} These data virtually cover whole industrial activities in Japan. We regard firms as nodes and transaction relations between them as directed links spanning from suppliers to customers to construct the latest production network in Japan. Since information on the volume of each transaction is not available, we assume that all the links have the same weight.

To elucidate flow structure in the TSR transaction network, we begin with a bow-tie decomposition of the network as has been widely used to understand the structure of various complex networks including the world wide web and metabolic networks. The decomposition classifies nodes in a directed network according to the way in which they are mutually connected: IN component, GSCC (Giant Strongly Connected Component), OUT component, and others. The GSCC is the largest group of nodes in which any pairs of nodes are connected bidirectionally. The IN component is a collection of nodes which have a path to the GSCC, but no reverse path to come back. The OUT component is defined in the other way around, that is, a collection of nodes which are reachable only from the GSCC. The TSR transaction network is decomposed into 219,927 IN components, 530,174 GSCC components, 278,880 OUT components and 37,056 Others.

*z*direction according to their values of the Helmholtz–Hodge potential; basically, transaction flows from top to bottom. On the other hand, the

*x*and

*y*coordinates of nodes are determined by minimizing the potential energy in a spring-electric model.

One can characterize these 10 communities by industrial and regional affiliations of their constituent firms. They are divided into two contrastive groups. The first, second, fourth and fifth largest communities are featured by manufacturing and wholesales industries; medical, health care & welfare industry is additionally important for the fourth community. On the other hand, the remaining 6 communities are featured by construction industry. Also, all the major communities have prominent regional characteristics. The manufacturing and wholesales dominant communities are basically metropolitan communities except for the second largest community in which Hokkaido and some provincial prefectures play a key role. In contrast, distribution of the regional affiliations in the construction dominant communities are well localized at prefecture level.

The communities 1, 2, 4, and 5 are all communities in which manufacture and wholesales dominate, but one can extract difference among them by using more detailed classification. The community 1 includes many manufacture and wholesales of textile and apparel. The community 2 includes fisheries cooperative, wholesales and retail trade of seafood, manufacture of food of seafood. The community 4 includes Medical and health services, manufacture and wholesales of pharmaceutical products. Most of medical and health services are general hospitals and clinics. The community 5 includes many manufacture and wholesales of metal products and construction. In this way one can characterizes manufacture and wholesales communities by product which well related community. Although the community is an extracted dense part of loop flow network, it includes manufacture, wholesales and retail trade and form hierarchical structure of product generally called supply chain. The communities 3, 6, 7, 8, and 9 are communities in which construction dominates. Although industry distribution of the construction communities similar to each other, it can be characterized by its locality. In addition, firms contributing to the flow from downstream to upstream are not the main industries of the supply chain, but are complementary industries (road freight transport, equipment installation work, etc.).

From these results, we found that within the community, there is a hierarchical supply chain consisting of main industry of the community and a circular structure mainly consisting of industries other than the main industry.

## Conclusion

The comprehensive dataset of interfirm transaction relations in Japan enabled us to study the industrial flow structure of the nations production network with a sound microscopic foundation. Especially, we emphasized its hierarchy and circularity. By adopting the Helmholtz–Hodge decomposition, we separated the flow structure of the GSCC of the transaction network into two components: potential flow and circular flow. The potential flow between a pair of firms is given by the difference of their potentials, and hence, the potential of a firm identifies its hierarchical position in the transaction network. On the other hand, the circular flow component illuminates feedback loops built in the network. The layout was calculated and visualized by the spring-electric model with the constraint condition corresponding to the Helmholtz–Hodge potential. We also identified dominant clusters of firms forming feedback loops by applying the map equation method to the extracted circular flow network. We found that both hierarchical and loop structure coexist within the major industries such as construction, manufacturing, and wholesales.

## Footnotes

- 1.
This is the largest connected component in the network obtained from the original data, containing 99.3% of all active firms listed in the data.

## Notes

### Acknowledgements

This study has been conducted as a part of the project “Large-scale Simulation and Analysis of Economic Network for Macro Prudential Policy” undertaken at Research Institute of Economy, Trade and Industry (RIETI). This research was also supported by MEXT as Exploratory Challenges on Post-K computer (Studies of Multi-level Spatiotemporal Simulation of Socioeconomic Phenomena).

## References

- 1.Allen, M. P. P. & Tildesley, D. J. (1987).
*Computer simulation of liquids*(vol. 82, pp. 5057–5061).Google Scholar - 2.Bhatia, H., Norgard, G., Pascucci, V., & Bremer, P. T. (2013). The Helmholtz–Hodge decompositiona survey.
*IEEE Transactions on visualization and computer graphics*,*19*(8), 1386–1404.CrossRefGoogle Scholar - 3.Chakraborty, A., Kichikawa, Y., Iino, T., Iyetomi, H., Inoue, H., Fujiwara, Y., & Aoyama, H. (2018). Hierarchical communities in walnut structure of Japanese production network, available at SSRN: https://ssrn.com/abstract=3129974 or https://doi.org/10.2139/ssrn.3129974.
- 4.Frenkel, D., & Smit, B. (2002).
*Understanding molecular simulations: from algorithms to applications*. Academic press.Google Scholar - 5.Fujita, Y., Fujiwara, Y., & Souma, W. (2016). Large directed-graph layout and its application to a million-firms economic network.
*Evolutionary and Institutional Economics Review*,*13*(2), 397–408.CrossRefGoogle Scholar - 6.Hu, Y. (2005). Efficient, high-quality force-directed graph drawing.
*Mathematica Journal*,*10*(1), 37–71.Google Scholar - 7.Jiang, X., Lim, L. H., Yao, Y., & Ye, Y. (2011). Statistical ranking and combinatorial Hodge theory.
*Mathematical Programming*,*127*(1), 203–244.CrossRefGoogle Scholar - 8.Lancichinetti, A., & Fortunato, S. (2009). Community detection algorithms: A comparative analysis.
*Physical Review E*,*80*(5), 056117.CrossRefGoogle Scholar - 9.Rosvall, M., & Bergstrom, C. T. (2008). Maps of random walks on complex networks reveal community structure.
*Proceedings of the National Academy of Sciences*,*105*(4), 1118–1123.CrossRefGoogle Scholar