Since various systems with complex interactions can be abstractly represented as the network, in the past two decades, network science has developed rapidly and has been successfully applied in many fields such as biology, economy, and social sciences. Mining the general laws of different complex networks in various fields has always been the focus of network science.
The studies of universal laws in the network mainly include two aspects: network topology structure and dynamics. Because the functions of the network highly depend on its structure, the study about network structure becomes the basic and focused topic. In fact, the research on small-world networks and scale-free networks has promoted the development of network science by revealing the new structural characteristics of complex networks. For the characterization of network structure, compared with the microscopic approaches such as node degree and clustering coefficient and macroscopic approaches such as degree distribution and distance,
mesoscopic measurement can better unify the detailed and global rich information of the network and has been the focus of attention. As mesoscopic approaches of community structure, motif and K-shell, the existing P matrix of network is essentially a mesoscopic measurement of network structure, in addition to advantages of strong interpretability and relatively low time complexity, but it has not been systematically and thoroughly studied. Therefore, the main work of this paper focuses on mining the characteristics of network P matrix, and the development of P-matrix-based methods including quantization of the network structure dissimilarity as well as the characterization and visualization of directivity of network dissimilarity, and then carries out its application in the extraction of network skeleton. The main work and innovations of the paper are as follows:
1、This paper develops and enriches the analysis methods about mesoscopic network structure based on network P-matrix, and gives corresponding results on four typical network structure problems: node similarity, node clustering, network evolution and network classification. Specifically, we conduct from two aspects of nodes' P-vector and the P-vector of the entire network. Specifically, from the node level, we first design an approach of quantifying the similarity between network nodes by using the distribution of their nearest neighbors, and verify its effectiveness through eigenvector centrality and propagation ability. And then, a node-clustering algorithm based on the new similarity metric is proposed and the hierarchical structure is discovered in complex network. In addition, at the network level, a network-classification method integrating the network's P-vector and the difference between the nodes' P-vector is proposed and compared with the traditional feature-vector method. The two indicators of classification accuracy and time consumption show the new P-vector-based classification method has better effect. Various experimental results show the effectiveness of the new network analysis methods based on network P-matrix, and also illustrate the feasibility and effectiveness of the P-matrix on network structure characterization.
2、We construct and extend the P-matrix to weighted network, and further put forward the WD-measure to quantify the structural dissimilarity between weighted networks, which deals with the limitation that D-measure was only suitable for unweighted networks. Firstly, the weighted distance between each pair of nodes of the network is rescaled by the ratio between the average shortest path of the weighted network and that of its binary counterpart, to count the number of nodes in different order neighbor sets. As a result, the P-matrix only applicable to the unweighted is extended to the widely existing weighted network. In addition, we give a definition and a calculation to the complement graph and alpha centrality of weighted network respectively, so that the WD measurement is established to quantify the difference of two weighted networks. Finally, we conduct a variety of simulation experiments, mainly including comparison between complete graphs with four common weight distributions and not, and graphs with various density, comparison between neural networks, and comparison between several real world networks. Experiments on simulated networks and real networks show that the new metric can effectively capture the influence of weights on network structure, and quantitatively measure the dissimilarity of weighted networks. Moreover, it is suitable for the comparison between networks which have different sizes and are even disconnected.
3、Based on the research of network structure differentiation, the directivity problem in network distance is proposed, and the method of describing and visualizing the directivity of network structure differentiation is developed. Considering that the existing network distance only focuses on the size but ignores the direction, we set up the index and method to describe the directivity of structural differences, and integrate the discussion of direction into the design of an approximate layout algorithm for multiple networks on a two-dimensional plane. Constructing a random network as the reference network, the algorithm realizes the geometric positioning combining the distance size and direction between each network and random network in two-dimensional space, and gives a description that integrates more detailed structure information of the network. In addition, after obtaining the layout result, we bring in the K-means clustering algorithm and propose a network-clustering method. Combining the locations and clustering results of multiple real networks can help analyze the similar structural patterns between them, so as to explore the hidden structural information in the real network.
4、Based on the WD metric, we transform the skeleton extraction into an optimization problem of finding subnetworks with the minimum WD, and then propose a network skeleton extraction algorithm (DFSA) based on network structure differences. In order to verify the effectiveness of the new method and observe its characteristics, we conduct a variety of experiments on synthetic and real-world networks. We compare the DFSA with the classical global thresholding and the Disparity methods in some indicators including diameter, average link salience, node ratio and percolation threshold. In addition, we also observe the importance of retained nodes in subnetwork with fixed density. The results show that our method can retain the properties better, especially on distance-dependent attributes and network with stronger heterogeneity. Moreover, the DFSA method does not belittle weak nodes and edges that might play a vital role in connectivity,which ensures the skeleton can preserve the properties of the original network to a greater extent.