Web analystics

This course mainly introduces two parts of network analysis, that is, web content analysis and link analysis. By utilizing relevant practical cases in business and software tools such as Gephi as well as programming language Python, this course aims to explain and discuss the basic concepts in network analysis, core measurements, classical algorithms, and applications scenarios, including:

  • web content analysis methods, such as vector space model, latent semantic indexing, latent dirichlet allocation;
  • link analysis methods, such as nodes measurements, network measurements, algorithms in information retrieval (PageRank, HITS), community detection algorithms, and bipartite graph mining;
  • using software Gephi and programming language Python to conduct network analysis.