aboutsummaryrefslogtreecommitdiffhomepage
path: root/third_party/cub.BUILD
diff options
context:
space:
mode:
authorGravatar Akshay Modi <nareshmodi@google.com>2018-08-17 16:18:24 -0700
committerGravatar TensorFlower Gardener <gardener@tensorflow.org>2018-08-17 16:25:56 -0700
commitf1ad54b58b7ce2e08b5f4e38a1631dc667e3e7af (patch)
tree7bd201b89817a8b516e6adb19ac7e5eccefc74f4 /third_party/cub.BUILD
parentfbdef63fe5849cde5423f8c3cc9c348ed4fe75c3 (diff)
Add a benchmark for forward+backward for defuns.
Also fix some simple issues that I saw when I benchmarked it (goes from ~3500 examples/sec -> ~4000 examples/sec) - (nest) Expose is_mapping check that caches to python. - (nest) Stop calling flatten when unnecessary in pack_sequence_as - (nest) Set some functions to their swig wrappers directly (instead of wrapping them in another function) - Directly call the gen_math_ops call in _aggregate_grads to skip any unnecessary python overhead. - Stop falling back to slow path in _fast_fill. PiperOrigin-RevId: 209223633
Diffstat (limited to 'third_party/cub.BUILD')
0 files changed, 0 insertions, 0 deletions