В месу добавили даблы, clpeak на hd 7790 показывает это
clpeak                                                                                                                                                     
                                                                                                                                                                                                                                             
Platform: Clover                                                                                                                                                                                                                            
  Device: AMD BONAIRE                                                                                                                                                                                                                       
    Driver version  : 10.6.0-devel (Linux x64)                                                                                                                                                                                              
    Compute units   : 14                                                                                                                                                                                                                    
    Clock frequency : 1050 MHz                                                                                                                                                                                                             
                                                                                                                                                                                                                                            
    Global memory bandwidth (GBPS)                                                                                                                                                                                                          
      float   : 55.14                                                                                                                                                                                
      float2  : 56.52                                                                                                                                                                                              
      float4  : 54.39
      float8  : 38.98
      float16 : 24.86
 
    Single-precision compute (GFLOPS)
      float   : 1109.28
      float2  : 960.17
      float4  : 1109.53
      float8  : 1023.15
      float16 : 1075.14
 
    Double-precision compute (GFLOPS)
      double   : 113.89
      double2  : 113.82
      double4  : 113.68
      double8  : 113.42
      double16 : 112.92
 
    Integer compute (GIOPS)
      int   : 344.50
      int2  : 329.74
      int4  : 347.39
      int8  : 353.00
      int16 : 351.91
 
    Transfer bandwidth (GBPS)
      enqueueWriteBuffer         : 4.59
      enqueueReadBuffer          : 1.31
      enqueueMapBuffer(for read) : 8.45
        memcpy from mapped ptr   : 4.68
      enqueueUnmap(after write)  : 1429.37
        memcpy to mapped ptr     : 4.38
 
    Kernel launch latency : 473.19 us
https://github.com/krrishnarraj/clpeak 
много уже готовых результатов 
https://github.com/krrishnarraj/clpeak/tree/master/results




