Changes between Version 39 and Version 40 of RTSsummaryEvents

Show
Ignore:
Timestamp:
04/06/12 23:05:45 (14 months ago)
Author:
MikolajKonarski (IP: 95.160.111.162)
Comment:

improve wording and add extra examples

Legend:

Unmodified
Added
Removed
Modified
  • RTSsummaryEvents

    v39 v40  
    77==== Calculate the total memory allocated on a per-capability basis ==== 
    88 
    9 In addition to the existing global method. For now we just do 
     9In addition to the existing global method, we calculate the total  
     10memory allocated on a per-capability basis. For now we do 
    1011it both ways and assert they give the same grand total. At some 
    1112stage we can simplify the global method to just take the sum of 
     
    1415==== Change the presentation of parallel GC work balance in +RTS -s ==== 
    1516 
    16 Also rename internal variables to make the names match what they hold. 
     17We change the presentation of parallel GC work balance in +RTS -s 
     18and also we rename internal variables to make the names match what they hold. 
    1719The parallel GC work balance is calculated using the total amount of 
    1820memory copied by all GC threads, and the maximum copied by any 
     
    3436{{{ 
    3537  Parallel GC work balance: 4.56% (serial 0%, perfect 100%) 
     38}}} 
     39 
     40Some more examples of the new form from nofib/parallel on a 2-core machine with -N2: 
     41 
     42{{{ 
     43ray:  Parallel GC work balance: 13.81% (serial 0%, perfect 100%) 
     44gray:  Parallel GC work balance: 28.25% (serial 0%, perfect 100%) 
     45prsa:  Parallel GC work balance: 29.16% (serial 0%, perfect 100%) 
     46coins:  Parallel GC work balance: 62.43% (serial 0%, perfect 100%) 
     47blackscholes:  Parallel GC work balance: 15.25% (serial 0%, perfect 100%) 
     48minimax:  Parallel GC work balance: 79.86% (serial 0%, perfect 100%) 
     49nbody:  Parallel GC work balance: 52.39% (serial 0%, perfect 100%) 
     50}}} 
     51 
     52and a couple examples from other parts of nofib: 
     53 
     54{{{ 
     55nofib/gc/circsim:  Parallel GC work balance: 22.35% (serial 0%, perfect 100%) 
     56nofib/gc/fibheaps:  Parallel GC work balance: 7.07% (serial 0%, perfect 100%) 
     57nofib/spectral/clausify:  Parallel GC work balance: 3.48% (serial 0%, perfect 100%) 
     58nofib/real/cacheprof:  Parallel GC work balance: 0.90% (serial 0%, perfect 100%) 
    3659}}} 
    3760 
     
    123146== The analysis of the semantics of +RTS -s == 
    124147 
    125 Here is a sample output of +RTS -s, annotated with a discussion of new events required to simulate it in ThreadScope (for a user-selected time interval). 
     148Here is a sample output of +RTS -s that was used in the early design phases for the corresponding events. 
     149The output is annotated with a discussion of new events required to simulate it in ThreadScope (for a user-selected time interval). 
    126150A list of the new required events is in the second part of this page. 
    127151Here is a [https://github.com/Mikolaj/ThreadScope/raw/working/SummaryPanelMockup.png screenshot] 
     
    269293I'd guess the elapsed time should always be higher than the CPU time, shouldn't it? 
    270294 
    271 == The list of considered new events (the implementation closely follows this draft, even though not all details match) == 
     295== The list of considered new events (the actual implementation closely follows this draft, even though not all details match) == 
    272296 
    273297There is a wealth of statistics around heaps and GC. Some of the stats are reasonably common, shared by different implementations while many more are highly specific to a particular implementation. Even if we ignore RTSs other than GHC, we still have an issue of flexibility for future changes in GHC's RTS.