aboutsummaryrefslogtreecommitdiffhomepage
Commit message (Collapse)AuthorAge
* Re-rollback of "TensorFlow: move eigen some NN code from our ↵Gravatar Vijay Vasudevan2016-03-18
| | | | | | | third_party/eigen3 copy to being part of TF, add tests." Change: 117608627
* Build and visibility changesGravatar Martin Wicke2016-03-18
| | | | Change: 117608343
* histogram_ops refactor:Gravatar A. Unique TensorFlower2016-03-18
| | | | | | Make histogram_ops visible Make histogram_ops.histogram_fixed_width return histogram derived from current inputs only, rather than accumulate Change: 117602117
* Clarification on (and minor fix) for RunOptions/RunOutputs usage.Gravatar Zongheng Yang2016-03-18
| | | | Change: 117601377
* Convert RNN benchmarks to new Benchmark test & runner.Gravatar Eugene Brevdo2016-03-18
| | | | Change: 117599224
* Change Cuda pinned memory allocation to BFC allocator.Gravatar Xiaoqiang Zheng2016-03-18
| | | | | Move the GPU-neutral code to common_runtime. Change: 117591254
* Support ScratchAllocator in BLAS Batched GEMMGravatar A. Unique TensorFlower2016-03-18
| | | | Change: 117590857
* Minor simplification in Hinge prediction.Gravatar A. Unique TensorFlower2016-03-18
| | | | Change: 117590840
* Rollforward of "TensorFlow: move eigen some NN code from our ↵Gravatar Vijay Vasudevan2016-03-18
| | | | | | | third_party/eigen3 copy to being part of TF, add tests." Change: 117587217
* Add `tf.train.input_producer()` to the public API.Gravatar Derek Murray2016-03-18
| | | | | | This is a generalization of `tf.{range,slice,string}_input_producer()` that supports arbitrary types and shapes of input. Fixes #486. Change: 117583214
* Remove LongDelayedPollingTest from gpu_event_mgr_test.Gravatar A. Unique TensorFlower2016-03-18
| | | | | | | | | This test is flakey with any reasonable finite timeout, due to reliance on a separate thread to take action. In the unlikely event that the underlying mechanism breaks without breaking any of the remaining tests in this file, TF regression tests involving GPUs should break. Change: 117571507
* TensorFlow: update eigen to latest change to fix TensorChippingGravatar Vijay Vasudevan2016-03-18
| | | | Change: 117570343
* Fix control edge list in graph node info card.Gravatar James Wexler2016-03-18
| | | | Change: 117570066
* Fix whitespace error in benchmark.Gravatar Eugene Brevdo2016-03-18
| | | | Change: 117564348
* Exclude failing tests from ASAN testingGravatar Yaroslav Bulatov2016-03-18
| | | | Change: 117559940
* exclude failing test from ASAN testingGravatar Yaroslav Bulatov2016-03-18
| | | | Change: 117559932
* Fix graph display failure when a summary node is a control dependency.Gravatar James Wexler2016-03-18
| | | | Change: 117557479
* Add TensorFlow python Benchmark base class, registration mechanism, and test ↵Gravatar Eugene Brevdo2016-03-18
| | | | | | | runner. Outputs proto strings in a way similar to the reporter.cc in tensorflow/core/util/ Change: 117556944
* Extend allocator_test to ensure that AllocatorAttributes values are all storedGravatar A. Unique TensorFlower2016-03-18
| | | | | independently. Change: 117555238
* Add benchmark tests for depthwise conv forward gpu kernelsGravatar Jianmin Chen2016-03-18
| | | | | | | | | | | | Benchmark Time(ns) CPU(ns) Iterations BM_ConvFloatDepthwiseFwdGPU_conv0 4800416 4937895 141 32.7G items/s 32_112_112_3_8_24_3_3_1_2_gpu BM_ConvFloatDepthwiseFwdGPU_conv1 13550072 13922813 100 30.9G items/s 32_112_112_64_1_64_3_3_1_2_gpu BM_ConvFloatDepthwiseFwdGPU_conv2 7032385 7324553 100 29.4G items/s 32_56_56_128_1_128_3_3_1_2_gpu BM_ConvFloatDepthwiseFwdGPU_conv3 2285033 2425335 228 22.2G items/s 32_56_56_128_1_128_3_3_2_2_gpu BM_ConvFloatDepthwiseFwdGPU_conv4 1743948 1858093 359 29.0G items/s 32_28_28_128_1_128_3_3_1_2_gpu BM_ConvFloatDepthwiseFwdGPU_conv5 1784560 1897147 320 28.4G items/s 32_14_14_512_1_512_3_3_1_2_gpu BM_ConvFloatDepthwiseFwdGPU_conv6 971179 1044185 562 25.8G items/s 32_7_7_1024_1_1024_3_3_1_2_gpu Change: 117553964
* Make SummaryWriter take a `Graph` instead of `GraphDef` using the renamed ↵Gravatar Dan Smilkov2016-03-18
| | | | | | | | | `graph` argument. Keep the now deprecated `graph_def` argument for backward compatibility. This allows us to add information to the graph (such as tensor shapes and types) before serializing it to the events file, which results in the user automatically getting that information in the graph visualizer. Change: 117546499
* Pass through the rank, if known, of SparseTensors through tf.batch and friends.Gravatar Eugene Brevdo2016-03-18
| | | | Change: 117545997
* Test for distributed (grpc) runtime in OSS TensorFlowGravatar A. Unique TensorFlower2016-03-18
| | | | | | | | | | | | | | | | | | | | See README.md for detailed descriptions of the usage of the tools and tests in this changeset. Three modes of testing are supported: 1) Launch a local Kubernetes (k8s) cluster and run the test suites on it (See local_test.sh) 2) Launch a remote k8s cluster on Google Container Engine (GKE) and run the test suite on it (See remote_test.sh) 3) Run the test suite on an existing k8s TensorFlow cluster (Also see remote_test.sh) Take the remote test for example, the following steps are performed: 1) Builds a Docker image with gcloud and Kubernetes tools, and the latest TensorFlow pip installed (see Dockerfile) 2) Launches a Docker container based on the said image (see test_distributed.sh) 3) From within the image, authenticate the gcloud user (with credentials files mapped from outside the container), configer the k8s cluster and launch a new k8s container cluster for TensorFlow workers 4) Generate a k8s (yaml) config file and user this yaml file to create a TensorFlow worker cluster consisting of a certian number of parameter servers (ps) and workers. The workers are exposed as external services with public IPs (see dist_test.sh) 5) Run a simple softmax MNIST model on multiple workers, with the model weights and biases located on the ps nodes. Train the models in parallel and observe the final validation cross entropy (see dist_mnist_test.sh) Change: 117543657
* Creating a build target for kernels in contrib.Gravatar A. Unique TensorFlower2016-03-18
| | | | Change: 117523151
* Fix a static_cast overflow in WorkSharder Shard impl.Gravatar A. Unique TensorFlower2016-03-18
| | | | Change: 117520810
* Rollback of "TensorFlow: move eigen some NN code from our third_party/eigen3 ↵Gravatar Vijay Vasudevan2016-03-18
| | | | | | | copy to being part of TF, add tests." Change: 117519243
* Fix two potential asynchrony bounds-check bugs in transpose op.Gravatar David G. Andersen2016-03-18
| | | | Change: 117518926
* Refactoring common checking and size computation code into aGravatar David G. Andersen2016-03-18
| | | | | | | | | | | separate struct that is shared by all of the image resizers. Normalizes the error checking across all of the resizers. Also added a max size check to nearest_neighbor - because of the floats, it starts to produce bad results after 2^24px in either direction. Not that anyone does that, but it's good to be precise about it. Change: 117516271
* Add optional functionality to save StepStats proto to the filesystem in the ↵Gravatar Andrew Harp2016-03-18
| | | | | | | Android demo, for performance analysis. Enable by hardcoding kSaveStepStats to true or passing "--copt -DSAVE_STEP_STATS" to bazel build. Change: 117512949
* TensorFlow: move eigen some NN code from our third_party/eigen3 copyGravatar Vijay Vasudevan2016-03-18
| | | | | to being part of TF, add tests. Change: 117509710
* TensorFlow: update eigen to latest release that has a fix to too large frame.Gravatar Vijay Vasudevan2016-03-18
| | | | Change: 117506296
* Add fast path for identity transpose.Gravatar A. Unique TensorFlower2016-03-18
| | | | Change: 117505457
* Removed height:0px on tf-multi-checkbox because it was behaving poorly for ↵Gravatar A. Unique TensorFlower2016-03-18
| | | | | | embedded components. Change: 117504934
* Update losses in contrib.layers.Gravatar A. Unique TensorFlower2016-03-18
| | | | Change: 117504830
* Patching placer to print out a list of devices when a placement cannot be ↵Gravatar A. Unique TensorFlower2016-03-18
| | | | | | made to aid debugging. This instantly answers the question: did I specify it wrong or is the device not found? Change: 117493711
* Added basic support for float16 on CPUs and older GPUs.Gravatar Benoit Steiner2016-03-18
| | | | | Also fixed compilation issues with cuda devices that support the compute model 5.3 Change: 117493644
* GPU implementations of space_to_depth and depth_to_space.Gravatar A. Unique TensorFlower2016-03-18
| | | | Change: 117493386
* C++ gradients: Changes API to require specification of node outputs (also ↵Gravatar A. Unique TensorFlower2016-03-18
| | | | | | added some code cleanup). Change: 117488572
* Make EmbeddingWrapper accept an explicit embedding_size argument so it does ↵Gravatar Lukasz Kaiser2016-03-18
| | | | | | not rely on cell.input_size. Change: 117484994
* Clarifying comment and adding check in should_stop after hitting a confusing ↵Gravatar A. Unique TensorFlower2016-03-18
| | | | | | snag. Change: 117484454
* Adding a library for creating a stable categorical color scale for ↵Gravatar A. Unique TensorFlower2016-03-18
| | | | | | real-time, changing data. Change: 117483893
* Fix qualified name of SummaryWriter in htmlGravatar Geoffrey Irving2016-03-18
| | | | Change: 117483092
* Added check for 0 length input before it can get to Eigen.Gravatar Martin Wicke2016-03-18
| | | | Change: 117482953
* fix implicit int64 to int32 conversionGravatar Jianmin Chen2016-03-18
| | | | Change: 117475266
* fix doc of enqueue_manyGravatar A. Unique TensorFlower2016-03-18
| | | | Change: 117471008
* fix indices in image describing dynamic_stitchGravatar A. Unique TensorFlower2016-03-18
| | | | Change: 117456435
* Move Relu and SoftMax Op declarations into header files so they can be ↵Gravatar A. Unique TensorFlower2016-03-18
| | | | | | | | | registered by experimental devices. Right now tensorflow/core/kernels explicitly depends on all Eigen devices that might want to implement any of the templated Eigen Ops. This is because the template classes that need to be specialized are defined in .cc files, so the specializations themselves have to appear there too. Moving the classes to .h files allows us to use arbitrary Eigen devices defined outside of tensorflow/core, which fits better with the intent behind core/kernels. Over time more kernels may need to be refactored this way for the same reason. Change: 117452814
* Add padding between main, in-extract and out-extract graphs.Gravatar Dan Smilkov2016-03-18
| | | | Change: 117444098
* Typo controling -> controlling.Gravatar A. Unique TensorFlower2016-03-18
| | | | Change: 117420208
* Allowing users to optionally provide a global step tensor and name to SDCA's ↵Gravatar A. Unique TensorFlower2016-03-18
| | | | | | minimize method. Change: 117401811