Statistics:Posted by Cristina Grigoruta — 21 Apr 2010 12:27
Yes, add all your reflexions in your proposal. Reviewing what other softwares propose could be also useful.Of course this idea is connected with UI development. Is it something that should be a part of my proposal?
Statistics:Posted by mbastian — 02 Apr 2010 16:48
I just read the article http://www.cmu.edu/joss/content/article ... McFarland/ and noticed interesting sentences:Step 1. We should look for 78 in our PR-Tree because we are interested in attributes values from upper bound of the time interval (tell me if I am mistaken).
This is something that should be done. All things considered Interval Tree is necessary in fact. I can implement this instead of PR-Tree and in the step in which we get intervals that overlap the interval we look for, I should do something with "ties". I think that for numbers a potential user should have got a possibility to choose what to do with them (sum, maximum, count and average as described in the article). For non-numeric attributes he should choose between something like the first occurrence, the last occurrence etc.[...] Thick slices have start and end times that define an interval, and include all events that either occur within the interval or whose duration intersect the interval (see figure 4c).9 Thick slices query the data similar to the way a questionnaire might ask about ongoing or past relationships. For example, "show me all the loans among firms that took place during 1994.” It is not a picture of a network at a point in time but rather a period in time, a collection of data within a certain range, as in the bin of a histogram. Thick slices are a way of discretizing (pseudo-) continuous data, and thin slices are a way to sample continuous or real-valued versions of discrete-interval data.
A slice is a “bin” that contains a set of events in time, but for most network operations, the time information is not used and the network data must be collapsed to form a matrix or arc-list. Depending on the duration or "thickness" of the slice, it is possible that there is more than one arc between a given pair of nodes (or there may be multiple kinds of ties) so it is important to consider how these ties will be aggregated. For most variables, common operations like sum, maximum, count, and average can be used. (Non-numeric categorical attributes will need more sophisticated treatment in the future.)
So, this is "only" querying The Dynamic API to get graphs for time intervals and computing metrics of such graphs. What about efficiency? Sometimes it could take much time to compute metrics of a single "snapshot"...The idea is to combine dynamic filtering and metrics. The aim is to compute metrics, like "Betweenness Centrality" for the time-intervals. When topology is dynamic (nodes, edges with start/end dates), one can filter the graph and iterate over all intervals. Metrics framework would be extended to allow this.
It's a bit challenging, because it depends on abstract sense of attributes very much. Taking this into consideration I think a potential user should decide what kind of visualization should be used for each attribute. For example:Propose idea for dynamic visualization of attributes (color, size, label color, label size, edge weight).
Statistics:Posted by cezar_1 — 01 Apr 2010 14:44
The idea is to combine dynamic filtering and metrics. The aim is to compute metrics, like "Betweeness Centrality" for the time-intervals. When topology is dynamic (nodes, edges with start/end dates), one can filter the graph and iterate over all intervals. Metrics framework would be extended to allow this.Does it mean that we should be able to query Metrics framework in order to get metrics for some time interval? Is it sufficient to add additional methods to its API?
Statistics:Posted by mbastian — 01 Apr 2010 13:06
CODE:
<?xml version="1.0" encoding="UTF-8"?><gexf xmlns="http://www.gephi.org/gexf/1.1draft" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.gephi.org/gexf/1.1draft http://gephi.org/gexf/1.1draft.xsd" version="1.1"> <meta lastmodifieddate="2009-03-20"> <creator>Gexf.net</creator> <description>A Web network changing over time</description> </meta> <graph mode="dynamic" defaultedgetype="directed" start="2009-01-01" end="2009-03-20"> <attributes class="node" mode="static"> <attribute id="0" title="url" type="string"/> <attribute id="1" title="frog" type="boolean"> <default>true</default> </attribute> <attributes class="node" mode="dynamic"> <attribute id="2" title="indegree" type="float"/> </attributes> <nodes> <node id="0" label="Gephi" start="2009-03-01"> <attvalues> <attvalue for="0" value="http://gephi.org"/> <attvalue for="2" value="1"/> </attvalues> </node> <node id="1" label="Webatlas"> <attvalues> <attvalue for="0" value="http://webatlas.fr"/> <attvalue for="2" value="1" end="2009-03-01"/> <attvalue for="2" value="2" start="2009-03-01" end="2009-03-10"/> <attvalue for="2" value="1" start="2009-03-10"/> </attvalues> </node> <node id="2" label="RTGI"> <attvalues> <attvalue for="0" value="http://rtgi.fr"/> <attvalue for="2" value="0" end="2009-03-01"/> <attvalue for="2" value="1" start="2009-03-01"/> </attvalues> <slices> <slice end="2009-03-01"> <slice start="2009-03-05" end="2009-03-10"> </slices> </node> <node id="2" label="BarabasiLab"> <attvalues> <attvalue for="0" value="http://barabasilab.com"/> <attvalue for="1" value="false"/> <attvalue for="2" value="0" end="2009-03-01"/> <attvalue for="2" value="1" start="2009-03-01"/> </attvalues> </node> </nodes> <edges> <edge id="0" source="0" target="1" start="2009-03-01"/> <edge id="1" source="0" target="2" start="2009-03-01" end="2009-03-10"/> <edge id="2" source="1" target="0" start="2009-03-01"/> <edge id="3" source="2" target="1" end="2009-03-10"/> <edge id="4" source="0" target="3" start="2009-03-01"/> </edges> </graph></gexf>
Does it mean that we should be able to query Metrics framework in order to get metrics for some time interval? Is it sufficient to add additional methods to its API?Adapt Metrics framework to use Dynamic API to propose dynamic versions of existing metrics.
Statistics:Posted by cezar_1 — 31 Mar 2010 20:59