Options: Input size: N = 32, C = 64, H = 256, W = 256 Output size: N = 32, K = 64, OH = 241, OW = 241 Filter size: K = 64, C = 64, R = 16, S = 16 Number of iterations: 10 Validation: off Initializing... done! Initializing Convolution... Calculating...(iter=0) 0.647481 sec Calculating...(iter=1) 0.556199 sec Calculating...(iter=2) 0.558513 sec Calculating...(iter=3) 0.557529 sec Calculating...(iter=4) 0.557189 sec Calculating...(iter=5) 0.558538 sec Calculating...(iter=6) 0.557174 sec Calculating...(iter=7) 0.561172 sec Calculating...(iter=8) 0.557139 sec Calculating...(iter=9) 0.591800 sec Avg. throughput: 6849.505823 GFLOPS