| Commit message (Collapse) | Author | Age |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
* Change `reduce_logsumexp` to internally use `reshape` rather than `squeeze`
since the latter requires the `axis` arg to be a Python `list`.
PiperOrigin-RevId: 183396533
* Kernel utils to support broadcast add and mul.
PiperOrigin-RevId: 183397494
* Updating sparsify_gather.
PiperOrigin-RevId: 183402917
* [tf.data] Move slow-path-related code into the slow path in IteratorHandleOp::Compute().
This slightly reduces the amount of work performed when an iterator is accessed (after the first access), and potentially reduces contention if concurrent steps are accessing the same iterator.
PiperOrigin-RevId: 183406221
* Cleanup: Ran clang-format on all *.{cc,h} in under grappler.
PiperOrigin-RevId: 183406440
* Increase shard count of //third_party/tensorflow/python:nn_batchnorm_test to avoid timeouts
When run under asan, the test runs for about 5 minutes, and sometimes
longer, causing frequent timeouts.
This change increases the shard count of the test to 4, which brings the run time
of the longest running shard under asan to about 2 minutes.
PiperOrigin-RevId: 183414888
* Add available choices to toco flags and fix minor formatting issues.
PiperOrigin-RevId: 183415713
* Performance improvements to some GPU code to use shared locks instead of unique locks for some hotspot cases.
PiperOrigin-RevId: 183418559
* [XLA] Improve error message for bad slices.
PiperOrigin-RevId: 183420038
* Fix py3 build rules for all py tests under py2tf.
PiperOrigin-RevId: 183422144
* Fix bug with Operation._control_inputs setter.
PiperOrigin-RevId: 183422192
* Make softmax_op_test.py work with C API enabled.
PiperOrigin-RevId: 183422829
* Cleanup: Ran clang-format on all *.{cc,h} files in tensorflow/core/kernels.
PiperOrigin-RevId: 183423961
* Fix the documentation for the dense layer for how rank > 2 inputs are handled.
PiperOrigin-RevId: 183425868
* Cleanup: Ran clang-format on all *.{cc,h} in tensorflow/core/ops.
PiperOrigin-RevId: 183429339
|
|
|
|
| |
PiperOrigin-RevId: 158120864
|
|
|
|
| |
Change: 123900938
|
|
|
|
| |
Change: 115511794
|
|
|
|
|
|
| |
that are built with TensorFlow (protobuf), so prefix our macros with
TF_ to make them project specific.
Change: 113197186
|
|
|
|
|
| |
tensorflow/core/ files and build targets.
Change: 113075177
|
|
|
|
|
|
| |
allocated once in the OpKernelContext::Params struct, then re-used every time a new OpKernelContext uses the Params. Thus in the executor, as long as there is more work to do the PerOpGpuDevice is not freed.
Change: 112909215
|
|
|
|
| |
Change: 112523833
|
|
|
|
| |
Change: 112481326
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
well as a hash-table lookup per allocated output.
Instead, we now pre-compute the AllocatorAttributes for every output
tensor in the graph into an array (indexed by a base number per node +
output index), and changed OpKernelContext::Params to provide
a pointer to the base of the array for the node, rather than providing
a std::function<>.
Updated test code to avoid so much code duplication when setting up
the OpKernelContext::Params object in various places.
Used gtl::InlinedVector<...> instead of std::vector<...> in a few
places in tensorflow/core/kernels/reduction_ops_common.h
Didn't make a measurable change in overall performance but allocations and
time spent in the std::function destructor code was significantly reduced.
Change: 112103260
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
Change 109945903
Make unsorted_segment_sum detect negative indices
Previously it crashed. This fixes #466.
Also improve the error message to say which index is problematic.
Change 109942557
Fix the conv_grad_input with stride 2.
+ We always call the Cudnn implementation even if we have an incompatible
padding.
Base CL: 109948577
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
Changes:
* error message that refers to removed `DefaultSession` method.
* -Wnull-conversion warnings
* the "_start_time" attr for recvs when the flag "--brain_enable_scheduling_for_recvs" is set.
* typo in tutorial data download progress message.
* a typo ("however their installing"=>"however installing").
* typo, rename "TensorFlow Mechanics" to "How To" to be consistent with the website.
* a typo ("subtact"=>"subtract").
* protobuf examples in comments in tensorflow::Example.proto.
* formula formatting in MNIST beginner tutorial
* negative fraction-of-queue-full stats
* protobuf inclusion path so that Android demo will build under Blaze.
* small typo (moderatly > moderately)
* Session.run() to check that tensor arguments come from the session's graph.
* another six import
* seq2seq typo in bazel command
Base CL: 108349164
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
error handling, updates to website.
Changes:
- Removes redundant reshape from image models by @mrry
- Default TensorBoard to localhost by @danmane
- Reformatting of tensorflow/core by @josh11b
- Make tutorials backwards compatible to 0.5.0 by @girving
- Improve print documentation (md files not updated).
- Add proper scrolling to sitemap by @martinwicke
Base CL: 107956254
|
|
TensorFlow is an open source software library for numerical computation
using data flow graphs.
Base CL: 107276108
|