Questions tagged [graphs]

For the nodes and links sense of graph; use the visualization tag for the charting sense.

226 questions
28
votes
8 answers

Visualizing a graph with a million vertices

What is the best tool to use to visualize (draw the vertices and edges) a graph with 1000000 vertices? There are about 50000 edges in the graph. And I can compute the location of individual vertices and edges. I am thinking about writing a program…
Cici
  • 453
  • 1
  • 4
  • 10
18
votes
5 answers

Anconda R version - How to upgrade to 4.0 and later

I use R through the anaconda navigator, which manages all my package installations. I need to use qgraph for a project, which is dependent on mnormt library, which in turn needs RStudio verion >4.0 I think the solution to my problem would be to…
Saranya Prakash
  • 193
  • 1
  • 1
  • 4
17
votes
1 answer

How to use Scikit-Learn Label Propagation on graph structured data?

As part of my research, I am interested in performing label propagation on a graph. I am especially interested in those two methods: Xiaojin Zhu and Zoubin Ghahramani. Learning from labeled and unlabeled data with label propagation. Technical…
15
votes
4 answers

What are graph embedding?

I recently came across graph embedding such as DeepWalk and LINE. However, I still do not have a clear idea as what is meant by graph embeddings and when to use it (applications)? Any suggestions are welcome!
Volka
  • 731
  • 3
  • 6
  • 21
14
votes
4 answers

Large Graphs: NetworkX distributed alternative

I have built some implementations using NetworkX(graph Python module) native algorithms in which I output some attributes which I use them for classification purposes. I want to scale it to a distributed environment. I have seen many approaches like…
20-roso
  • 708
  • 1
  • 5
  • 15
13
votes
1 answer

Neo4j vs OrientDB vs Titan

I am working on a data-science project related on social relationship mining and need to store data in some graph databases. Initially I chose Neo4j as the database. But it seams Neo4j doesn't scale well. The alternative I found out are Titan and…
Sreejithc321
  • 1,940
  • 3
  • 20
  • 34
10
votes
3 answers

Network analysis classic datasets

There are several classic datasets for machine learning classification/regression tasks. The most popular are: Iris Flower Data Set; Titanic Data Set; Motor Trend Cars; etc. But does anyone know similar datasets for networks analysis / graph…
sobach
  • 1,139
  • 5
  • 20
10
votes
1 answer

Libraries for (label propagation algorithms/frequent subgraph mining) for graphs in R

General description of the problem I have a graph where some vertices are labeled with a type with 3 or 4 possible values. For the other vertices, the type is unknown. My goal is to use the graph to predict the type for vertices that are…
Loiisso
  • 155
  • 8
9
votes
1 answer

What is the difference between affinity matrix eigenvectors and graph Laplacian eigenvectors in the context of spectral clustering?

In spectral clustering, it's standard practice to solve the eigenvector problem $$L v = \lambda v$$ where $L$ is the graph Laplacian, $v$ is the eigenvector related to eigenvalue $\lambda$. My question: why bother taking the graph Laplacian?…
felipeduque
  • 201
  • 1
  • 2
  • 5
9
votes
2 answers

Are there any graph embedding algorithms like this already?

I wrote an algorithm for generating node embeddings based on the graph's topology. Most of the explanation is done in the readme file and the examples. The question is: Am I reinventing the wheel? Does this approach have any practical advantages…
monomonedula
  • 201
  • 1
  • 2
9
votes
3 answers

In elbow curve how to find the point from where the curve starts to rise?

I am computing a distance metric on my data. The result is then being sorted in ascending order. The samples having distance more than a specific threshold are to be marked as outliers and will be discarded. Below is a plot of all distance…
Faiz Kidwai
  • 235
  • 1
  • 2
  • 12
8
votes
2 answers

Union grouping in bipartite graphs?

I'm trying to figure out a good (and fast) solution to the following problem: I have two models I'm working with, let's call them players and teams. A player can be on multiple teams and a team can have multiple players). I'm working on creating a…
Ian
  • 81
  • 1
8
votes
3 answers

Visualize graph network with more than 30k edges

I have a dataset with users connection and I want to create a directed network graph. Nodes: ~20.000 Edges: ~33.000 I used to work before on NodeXL and Gephi, but now I am on Mac and I don't have access on NodeXL. I tried Gephi and another tool…
Tasos
  • 3,960
  • 5
  • 25
  • 54
6
votes
3 answers

Partitioning Weighted Undirected Graph

I have an array of edges and weights: [['a', 'b', 4], ['a', 'c', 3], ['c', 'a', 2], ...] I have about 100,000 edges and weights are between 1 and 700, most around 100. I am thinking of using Markov Cluster Algorithm however wanted to reach out…
brianray
  • 211
  • 2
  • 4
6
votes
1 answer

Struggling to understand GCNNs (Graph Convolutional Neural Networks)

Although I've worked with CNN's for over a year, I am struggling to understand how GCNNs work paper on their simplification. I've read several papers, and I find myself out of my depth when they talk about Chebyshev polynomials or Fourier…
Jellicle
  • 163
  • 4
1
2 3
15 16