aboutsummaryrefslogtreecommitdiffhomepage
path: root/configure
Commit message (Collapse)AuthorAge
* Merge changes from github.Gravatar Shanqing Cai2018-03-12
| | | | PiperOrigin-RevId: 188817194
* Merge changes from github.Gravatar Vijay Vasudevan2017-07-28
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | END_PUBLIC I dropped the following commit because it doesn't compile. I will follow up with Andrew to fix it or revert it. Commit 003deb88b authored by osdamv<osdamv@gmail.com> Committed by Vijay Vasudevan<vrv@google.com>: Refactor and implementation of the camera API 1, it fixes #8736 (#10771) List of commits in this CL: --- Commit 446450369 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Use identity of param variable in cudnn_rnn.RNNParamsSaveable instead of parameter variable directly. The RNNParamsSaveable is usually used in a graph which also has a saver for the cudnn param variable itself, if the same op is used for both, fails with a two savers for same op error. PiperOrigin-RevId: 163431826 --- Commit d629a8316 authored by RJ Ryan<rjryan@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Increase bound on tf.contrib.signal.inverse_stft gradient error to avoid flakiness on macOS. PiperOrigin-RevId: 163426631 --- Commit 253bcbb71 authored by Kay Zhu<kayzhu@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: [XLA] Use HloEvaluator for convolution in reference_util. Also Speed up HloEvaluator's HandleConvolution in non-opt build, by moving calls to HloInstruction::shape() out of the inner loop. PiperOrigin-RevId: 163416183 --- Commit 569a00e68 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Update API to traffic in unique_ptrs rather than owning raw pointers PiperOrigin-RevId: 163414320 --- Commit 31a77bc77 authored by Asim Shankar<ashankar@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Java: Update release to 1.3.0-rc1 PiperOrigin-RevId: 163413736 --- Commit 1ebbf4325 authored by Jonathan Hseu<vomjom@vomjom.net> Committed by GitHub<noreply@github.com>: Add missing grpc dependency (#11828) --- Commit 905abb1f9 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Test asserts should have `expected` first. PiperOrigin-RevId: 163409348 --- Commit d5cc143e2 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Increase timeout to deflake the test. PiperOrigin-RevId: 163407824 --- Commit ce1c7f02a authored by Eli Bendersky<eliben@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Properly include logging header in xla_internal_test_main PiperOrigin-RevId: 163405986 --- Commit 22241cd42 authored by joetoth<joetoth@gmail.com> Committed by Vijay Vasudevan<vrv@google.com>: External leveldb link changed (#11833) table_format.txt was renamed to table_format.md --- Commit 6b7314de4 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Consolidating the code to fill the partition's function library into one place. Previously, Partition() and MasterSession::RegisterPartition() both fills in the partitioned graph's function library. PiperOrigin-RevId: 163400992 --- Commit 28373cfe7 authored by Frank Chen<frankchn@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Adds preliminary support for Cloud TPUs with Cluster Resolvers. This aims to allow users to have a better experienec when specifying one or multiple Cloud TPUs for their training jobs by allowing users to use names rather than IP addresses. PiperOrigin-RevId: 163393443 --- Commit e5353c941 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Don't prune nodes that have reference inputs. PiperOrigin-RevId: 163390862 --- Commit 226510834 authored by Asim Shankar<ashankar@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: C API: Groundwork for experimenting with TF_Tensor in device memory. TF_Tensor objects are always backed by host memory. This commit lays the groundwork for allowing TF_Tensor objects to refer to tensor data on device (e.g., GPU) memory. PiperOrigin-RevId: 163388079 --- Commit 613bf1c7c authored by Yuefeng Zhou<yuefengz@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: fix asan test failure in SingleMachineTest::ReleaseMemoryAfterDestruction. PiperOrigin-RevId: 163386941 --- Commit 4653d37a3 authored by Eli Bendersky<eliben@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: [XLA] Change type to appease GPU builds. PiperOrigin-RevId: 163384927 --- Commit 9f131bd15 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Internal change PiperOrigin-RevId: 163378484 --- Commit 8bc0236c8 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: PiperOrigin-RevId: 163366493 --- Commit 3b97f1f9b authored by Yangzihao Wang<yangzihao@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Change to only run one round of matmul benchmark. PiperOrigin-RevId: 163364341 --- Commit a4a3a3335 authored by Yun Peng<pcloudy@google.com> Committed by Vijay Vasudevan<vrv@google.com>: Fix ./configure on Windows (#11775) * Fix ./configure on Windows * Disable bitwise_ops_test on Windows --- Commit ae3119d16 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Small changes to op framework. PiperOrigin-RevId: 163361071 --- Commit f40189d26 authored by qjivy<ji.qiu@spreadtrum.com> Committed by Vijay Vasudevan<vrv@google.com>: PR again: Enable building label_image with jpeg/gif/png decoder for Android. (#11475) * Enable building label_image with jpeg/gif/png decoder for Android. Add dependency "android_tesnorflow_image_op" to label_image, which is not overlapped with android_tensorflow_kernels. * Running buildifier to reformat the BUILD files for sanity check. --- Commit 599165861 authored by KB Sriram<kbsriram@gmail.com> Committed by Vijay Vasudevan<vrv@google.com>: Add the Constant operator class (#11559) Create a custom operator class to create constants in the Graph, and introduce the Operator marker annotation to identify operator classes. Please see #7149 for the master tracking issue. --- Commit 86ca3506f authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Further BUILD cleanup PiperOrigin-RevId: 163360750 --- Commit 376bb063b authored by Pete Warden<petewarden@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Look inside functions to see which node types are used. PiperOrigin-RevId: 163360375 --- Commit 2139e7d8b authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: [tf.contrib.data] map expects a nested structure. Fixes #11786 PiperOrigin-RevId: 163359134 --- Commit d09304fca authored by Jonathan Hseu<vomjom@vomjom.net> Committed by Vijay Vasudevan<vrv@google.com>: Upgrade gRPC (#11768) * BUILD rule modifications * More build fixes * Code changes * More code fixes * Working tests * CMake build * Fix pprof * Fix header includes * CMake fix test * Bazel clean * Fix verbs * More verbs fixes * bazel clean for XLA * Windows build fix test * Add openssl/rand.h * New cmake build command * --config Release --- Commit 3cd828474 authored by David Norman<DavidNorman@users.noreply.github.com> Committed by Vijay Vasudevan<vrv@google.com>: Fix error with default python path selection (#11814) * Fix error with default python path selection * Move setting of environment var outside if / else --- Commit ddd8e21b7 authored by Eli Bendersky<eliben@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: [XLA] Consolidate all similar main()s in tests into a single target. PiperOrigin-RevId: 163354724 --- Commit a36bca25b authored by Tayo Oguntebi<tayo@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Remove ShapeWithoutPadding() utility function, as it is no longer needed. PiperOrigin-RevId: 163353430 --- Commit b26f9cd44 authored by David Norman<DavidNorman@users.noreply.github.com> Committed by Vijay Vasudevan<vrv@google.com>: Ensure that the multi-instruction fuse can take shared inputs (#11748) * Ensure that the multi-instruction fuse can take shared inputs Note that the fuse action only works when the shared input / constant appears after all of its consumers in the list of instructions. * Add a comment describing the test --- Commit 34cbf161d authored by Jiri Simsa<jsimsa@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Update Dataset API documentation. PiperOrigin-RevId: 163349457 --- Commit 2381ce5c3 authored by Abdullah Alrasheed<a.rasheed@tc-sa.com> Committed by Vijay Vasudevan<vrv@google.com>: DOC: Fix typo. (#11813) you could could be I/O bottlenecked. TO: you could be I/O bottlenecked. --- Commit e4a5c5356 authored by Toby Boyd<tobyboyd@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: ["Variable", "VariableV2", "VarHandleOp"] is the default for ps_ops=None PiperOrigin-RevId: 163344629 --- Commit 722f6f361 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Fix TensorForest's saveable object names so loading a savedmodel works. PiperOrigin-RevId: 163332598 --- Commit cda80a785 authored by Eric Liu<ioeric@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: [tpu profiler] Dump HLO graphs in profile responses to the log directory. PiperOrigin-RevId: 163318992 --- Commit cea9ef6f5 authored by horance<horance-liu@users.noreply.github.com> Committed by Vijay Vasudevan<vrv@google.com>: Refactoring device name utils (#11797) * remove duplicated code for full_name and legacy_name for DeviceNameUtils * replace tabs * Real->Device --- Commit 1f7c0f917 authored by Kongsea<kongsea@gmail.com> Committed by Vijay Vasudevan<vrv@google.com>: Refine docstrings (#11800) --- Commit dd1f0cddd authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Supports lookup devices by fullname either in the canonical form or the legacy form. This makes DeviceSet behaves the same as DeviceMgr's FindDevice method. PiperOrigin-RevId: 163300346 --- Commit 631a364cd authored by Kay Zhu<kayzhu@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: [XLA] Add Reduce, DynamicSlice and DynamicSliceUpdate to HloEvaluator. - Reduce is disabled explicitly for constant folding, as not all types of embedded computation can be currently supported by the evaluator. - Added support to evaluate HloModule to HloEvaluator. - Minor signature change to Evaluate(). PiperOrigin-RevId: 163299238 --- Commit a52470172 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Sets the incarnation number even when the attribute is set. PiperOrigin-RevId: 163299121 --- Commit a49fe0366 authored by Suharsh Sivakumar<suharshs@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Remove platform bridge for grpc_response_reader. PiperOrigin-RevId: 163295986 --- Commit 4404aa7cb authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: [XLA] Add TODO comment explaining why the IsScalar check exists. PiperOrigin-RevId: 163292777 --- Commit 43036ac16 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Remove unnecessary break statements. PiperOrigin-RevId: 163291947 --- Commit fd5de4690 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: [XLA] Add regression test for a corner case using Reduce that currently fails with the GPU backend. PiperOrigin-RevId: 163287986 --- Commit 32e198f2d authored by Chris Leary<leary@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: [TF:XLA] Add tf.cross support. See #11788 PiperOrigin-RevId: 163287731 --- Commit 88abddbc3 authored by Alan Yee<alyee@ucsd.edu> Committed by Vijay Vasudevan<vrv@google.com>: Update README.md (#11793) Remove bad practices of sudo pip and install use safer pip install commands --- Commit 9b30dc3a8 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Remove final mentions of `get_shape` in docstring. PiperOrigin-RevId: 163282839 --- Commit 423c1eea0 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: BREAKING CHANGE: Fix semantic error in how maybe_batch* handles sparse tensors. PiperOrigin-RevId: 163276613 --- Commit 6028c071b authored by Justin Lebar<jlebar@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Highlight incoming/outgoing edges on hover in HLO graphviz dumps, and other improvements. Other improvements: - Don't show tooltips for nodes and clusters. Previously we'd show a tooltip containing a pointer value expressed as decimal. Not so useful. - Show tooltips on edges with the to/from node names. - Fix bug wherein if we had - a node at the "edge" of the graph (so its operands aren't included unless they're referenced by another node), - with all of its operands included in the graph save one or more constants, and - those constants weren't referenced by any nodes not at the edge of the graph, we would incorrectly draw the node as "grayed out", indicating that one of its operands (namely, its constant operand) wasn't present in the graph. This is wrong because constants are inlined into their users, so they should always count as "displayed" for the purposes of determining whether a node is grayed out. PiperOrigin-RevId: 163276108 --- Commit ce7a355bd authored by Joshua V. Dillon<jvdillon@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Update contrib/distributions/estimator_test build dependency. PiperOrigin-RevId: 163272464 --- Commit 1b8458a1c authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Shorten docstring line. PiperOrigin-RevId: 163269709 --- Commit 69e323cc6 authored by Asim Shankar<ashankar@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Fix comment ypo PiperOrigin-RevId: 163266376 --- Commit 08790e73d authored by Chris Leary<leary@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: [XLA] Fix a bug in cloning outfeeds, carried the wrong shape. PiperOrigin-RevId: 163265592 --- Commit 1bad826d6 authored by Yangzihao Wang<yangzihao@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Rollback of GPU kernel implementation of transpose for tensors with one small dimension. END_PUBLIC BEGIN_PUBLIC BEGIN_PUBLIC Automated g4 rollback of changelist 162525519 PiperOrigin-RevId: 163490703
* Convert configure to python.Gravatar A. Unique TensorFlower2017-07-25
| | | | PiperOrigin-RevId: 163114551
* Re-enable using local MKL directory.Gravatar Gunhan Gulsoy2017-07-21
| | | | | | | Setting TF_MKL_ROOT environment variable (together with "--config=mkl") before build will enforce using a local copy of MKL during build. PiperOrigin-RevId: 162787130
* Tidy up opensouce mkl build.Gravatar Gunhan Gulsoy2017-07-21
| | | | | | Closes #11212 PiperOrigin-RevId: 162713588
* Suppress errors during the check of Bazel version.Gravatar A. Unique TensorFlower2017-07-20
| | | | PiperOrigin-RevId: 162677739
* Update the version check function for bazel.Gravatar A. Unique TensorFlower2017-07-20
| | | | PiperOrigin-RevId: 162658391
* Merge changes from github.Gravatar Jonathan Hseu2017-07-19
| | | | | | | | | | | | | | | | | END_PUBLIC --- Commit daa67ad17 authored by Jonathan Hseu<vomjom@vomjom.net> Committed by Frank Chen<frankchn@gmail.com>: Remove unittest import (#11596) --- Commit 491beb74c authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: BEGIN_PUBLIC Automated g4 rollback of changelist 162423171 PiperOrigin-RevId: 162541442
* Merge changes from github.Gravatar Shanqing Cai2017-07-10
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | END_PUBLIC --- Commit d0f53f77f authored by Penghao Cen<scorpiocph@gmail.com> Committed by Shanqing Cai<cais@google.com>: Minor fix typo (#11323) --- Commit 02fcf564e authored by Chris Song<sjhshy@gmail.com> Committed by Chris Song<sjhshy@gmail.com>: Fix misspells. --- Commit 764c9b6b4 authored by Louis Tiao<ltiao@users.noreply.github.com> Committed by GitHub<noreply@github.com>: Fixed typo in docstring --- Commit f8cd1283e authored by Shanqing Cai<cais@google.com> Committed by Shanqing Cai<cais@google.com>: Chaser --- Commit 01383b946 authored by Shanqing Cai<cais@google.com> Committed by Shanqing Cai<cais@google.com>: Adapt TensorFlowTestCase.setUp() to new reset_default_graph() semantics Avoid calling reset_default_graph() directly to prevent exceptions in cases where test methods error out from within nested graph contexts, which can leave _default_graph_stack non-empty in certain Python versions. --- Commit 0ffc37890 authored by Amit Patankar<amitpatankar@google.com> Committed by Amit Patankar<amitpatankar@google.com>: Removing second declaration of functions. --- Commit f9c9cacb0 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Refactor ElementalIrEmitter's slice index finding code into IrArray::Index::SourceIndexOfSlice(). PiperOrigin-RevId: 161140653 --- Commit ba297aec9 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Update ops-related pbtxt files. PiperOrigin-RevId: 161138258 --- Commit 68d666737 authored by Alexandre Passos<apassos@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Fixes a reentrant lock issue with tensors using ndarray memory which uses tensor memory. PiperOrigin-RevId: 161137788 --- Commit a2ee8bca3 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Add support for int8 x int8 -> int32 matrix multiplication via cublasGemmEx to stream_executor. PiperOrigin-RevId: 161137741 --- Commit 755fa7b50 authored by Mark Daoust<markdaoust@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Block generate_test, and docs generating from running in python3. - Doc generation is currently unsupported in python3 - These both end in errors in python 3.5.1+ PiperOrigin-RevId: 161137467 --- Commit 97cbcac45 authored by Peter Hawkins<phawkins@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: [TF:XLA] Fix failure in functionalize_control_flow rewrite for Enter nodes that are unused. Make sure we ignore such nodes without producing an error. PiperOrigin-RevId: 161136545 --- Commit dabcb60bc authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: [XLA] Add reasonable error messages to Builder::Build for bad parameter numbers. PiperOrigin-RevId: 161136262 --- Commit 0cbd249e8 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Add complex tensors support to `matrix_determinant`. PiperOrigin-RevId: 161132422 --- Commit 335f1f14d authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Extend static shape inference for SparseTensors with dense_shapes constructed using slicing. PiperOrigin-RevId: 161132391 --- Commit 53604916e authored by Jianwei Xie<xiejw@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Fixed the missing labels test in TPUEstimator. PiperOrigin-RevId: 161131282 --- Commit 9f57dc8dd authored by Bruno Rosa<bruno.rosa@eldorado.org.br> Committed by Bruno Rosa<bruno.rosa@eldorado.org.br>: Use mcpu instead of march for ppc64le march is not support by gcc on ppc64le --- Commit 7d5c74a9c authored by Skye Wanderman-Milne<skyewm@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Move duplicate detection logic from Graph to FunctionLibraryDefinition Turns out this is more useful, since there are many function libraries that don't belong to a graph. This will be used in a future change. Note that this maintains the current behavior of Graph. In addition, updates FunctionDefsEqual() to handle unset attr entries (I ran into this when using this in said future change). PiperOrigin-RevId: 161126628 --- Commit 2caec3af1 authored by Shanqing Cai<cais@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Disable more timeseries py tests failing in OSS PIP GPU builds PiperOrigin-RevId: 161124799 --- Commit 0b5cce367 authored by Eugene Brevdo<ebrevdo@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Get TopK op working on GPU again. Extend using cub's radix sort. 1. Undo rollback of Andreas Kirsch's initial implementation. 2. Use cub segmented radix sort if Andreas' heap-based impl for large k and small num_cols (thresholds of k=100, n=1000 determined empirically). 3. Use cub segmented radix sort if k == num_cols (this case is always faster). 4. Added benchmarks. Benchmarks show that the GPU implementation is up to 3x slower for small k but can be 10x faster for large num_cols and k. Benchmarks: Benchmark: m_128_n_10_k_5_use_gpu_False wall_time: 0.000166 s Throughput: 0.0077 GB/s Benchmark: m_128_n_10_k_5_use_gpu_True wall_time: 0.000796 s Throughput: 0.00161 GB/s Benchmark: m_128_n_10_k_9_use_gpu_False wall_time: 0.00017 s Throughput: 0.00751 GB/s Benchmark: m_128_n_10_k_9_use_gpu_True wall_time: 0.000796 s Throughput: 0.00161 GB/s Benchmark: m_128_n_10_k_10_use_gpu_False wall_time: 0.00017 s Throughput: 0.00753 GB/s Benchmark: m_128_n_10_k_10_use_gpu_True wall_time: 0.000775 s Throughput: 0.00165 GB/s Benchmark: m_128_n_100_k_1_use_gpu_False wall_time: 0.000155 s Throughput: 0.0826 GB/s Benchmark: m_128_n_100_k_1_use_gpu_True wall_time: 0.000796 s Throughput: 0.0161 GB/s Benchmark: m_128_n_100_k_50_use_gpu_False wall_time: 0.000247 s Throughput: 0.0519 GB/s Benchmark: m_128_n_100_k_50_use_gpu_True wall_time: 0.0008 s Throughput: 0.016 GB/s Benchmark: m_128_n_100_k_99_use_gpu_False wall_time: 0.000261 s Throughput: 0.049 GB/s Benchmark: m_128_n_100_k_99_use_gpu_True wall_time: 0.000794 s Throughput: 0.0161 GB/s Benchmark: m_128_n_100_k_100_use_gpu_False wall_time: 0.000239 s Throughput: 0.0536 GB/s Benchmark: m_128_n_100_k_100_use_gpu_True wall_time: 0.000777 s Throughput: 0.0165 GB/s Benchmark: m_128_n_1000_k_1_use_gpu_False wall_time: 0.000324 s Throughput: 0.395 GB/s Benchmark: m_128_n_1000_k_1_use_gpu_True wall_time: 0.000916 s Throughput: 0.14 GB/s Benchmark: m_128_n_1000_k_10_use_gpu_False wall_time: 0.00042 s Throughput: 0.305 GB/s Benchmark: m_128_n_1000_k_10_use_gpu_True wall_time: 0.000902 s Throughput: 0.142 GB/s Benchmark: m_128_n_1000_k_500_use_gpu_False wall_time: 0.0011 s Throughput: 0.116 GB/s Benchmark: m_128_n_1000_k_500_use_gpu_True wall_time: 0.00097 s Throughput: 0.132 GB/s Benchmark: m_128_n_1000_k_990_use_gpu_False wall_time: 0.00133 s Throughput: 0.0962 GB/s Benchmark: m_128_n_1000_k_990_use_gpu_True wall_time: 0.000993 s Throughput: 0.129 GB/s Benchmark: m_128_n_1000_k_1000_use_gpu_False wall_time: 0.00102 s Throughput: 0.126 GB/s Benchmark: m_128_n_1000_k_1000_use_gpu_True wall_time: 0.000964 s Throughput: 0.133 GB/s Benchmark: m_128_n_10000_k_10_use_gpu_False wall_time: 0.002 s Throughput: 0.64 GB/s Benchmark: m_128_n_10000_k_10_use_gpu_True wall_time: 0.00288 s Throughput: 0.445 GB/s Benchmark: m_128_n_10000_k_100_use_gpu_False wall_time: 0.00233 s Throughput: 0.549 GB/s Benchmark: m_128_n_10000_k_100_use_gpu_True wall_time: 0.00325 s Throughput: 0.394 GB/s Benchmark: m_128_n_10000_k_5000_use_gpu_False wall_time: 0.0127 s Throughput: 0.101 GB/s Benchmark: m_128_n_10000_k_5000_use_gpu_True wall_time: 0.00381 s Throughput: 0.336 GB/s Benchmark: m_128_n_10000_k_9900_use_gpu_False wall_time: 0.015 s Throughput: 0.0853 GB/s Benchmark: m_128_n_10000_k_9900_use_gpu_True wall_time: 0.00438 s Throughput: 0.292 GB/s Benchmark: m_128_n_10000_k_10000_use_gpu_False wall_time: 0.0104 s Throughput: 0.123 GB/s Benchmark: m_128_n_10000_k_10000_use_gpu_True wall_time: 0.00427 s Throughput: 0.3 GB/s Benchmark: m_128_n_100000_k_100_use_gpu_False wall_time: 0.0148 s Throughput: 0.865 GB/s Benchmark: m_128_n_100000_k_100_use_gpu_True wall_time: 0.0262 s Throughput: 0.488 GB/s Benchmark: m_128_n_100000_k_1000_use_gpu_False wall_time: 0.0201 s Throughput: 0.636 GB/s Benchmark: m_128_n_100000_k_1000_use_gpu_True wall_time: 0.0263 s Throughput: 0.486 GB/s Benchmark: m_128_n_100000_k_50000_use_gpu_False wall_time: 0.214 s Throughput: 0.0599 GB/s Benchmark: m_128_n_100000_k_50000_use_gpu_True wall_time: 0.0322 s Throughput: 0.398 GB/s Benchmark: m_128_n_100000_k_99000_use_gpu_False wall_time: 0.262 s Throughput: 0.0489 GB/s Benchmark: m_128_n_100000_k_99000_use_gpu_True wall_time: 0.0377 s Throughput: 0.34 GB/s Benchmark: m_128_n_100000_k_100000_use_gpu_False wall_time: 0.118 s Throughput: 0.108 GB/s Benchmark: m_128_n_100000_k_100000_use_gpu_True wall_time: 0.0365 s Throughput: 0.351 GB/s END_PUBLIC BEGIN_PUBLIC BEGIN_PUBLIC Automated g4 rollback of changelist 157169178 PiperOrigin-RevId: 161476569
* When GPU is configured, selection of config to set depends on value of ↵Gravatar A. Unique TensorFlower2017-06-28
| | | | | | TF_CUDA_CLANG (--config=cuda / --config=cuda_clang) PiperOrigin-RevId: 160433337
* Merge changes from github.Gravatar A. Unique TensorFlower2017-06-27
| | | | PiperOrigin-RevId: 160344052
* Automated g4 rollback of changelist 160182040Gravatar A. Unique TensorFlower2017-06-26
| | | | PiperOrigin-RevId: 160190881
* Merge changes from github.Gravatar A. Unique TensorFlower2017-06-26
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | END_PUBLIC Note: this CL will break builds. cl/159887762 to follow to fix all the breakages. --- Commit 2336cdf7f authored by Maxwell Paul Brickner<mbrickn@users.noreply.github.com> Committed by gunan<gunan@google.com>: Updated link to use HTTPS (#10998) Howdy! I just updated a link to use https instead of http. Thanks! --- Commit ad0892df1 authored by Luke Iwanski<luke@codeplay.com> Committed by Luke Iwanski<luke@codeplay.com>: [OpenCL] Fixes run_metadata_test for SYCL This test is designed to test CUDA specific behavior --- Commit 6b37a0725 authored by Todd Wang<toddwang@gmail.com> Committed by GitHub<noreply@github.com>: Update comments --- Commit 1699d904a authored by John Lawson<john@codeplay.com> Committed by Luke Iwanski<luke@codeplay.com>: [OpenCL] Fixes CUDA specific test run on SYCL (#56) The testBadParentValuesOnGPU should only be run on CUDA devices, as the test checks for particular CUDA behaviour. We don't actually provide a SYCL kernel for GatherTree and so it's not a problem that the tests don't target SYCL. --- Commit 3c1946230 authored by myPrecious<Moriadry@users.noreply.github.com> Committed by Shanqing Cai<cais@google.com>: Java API to get the size of specified input list of operations. (#10865) * Java API to get the size of specified input list of operations * remove unnecessary explain to avoid bring a new term to users. --- Commit e911c7480 authored by Luke Iwanski<luke@codeplay.com> Committed by Luke Iwanski<luke@codeplay.com>: [OpenCL] REGISTER -> REGISTER6 --- Commit fbf6c4cec authored by superryanguo<superryanguo@gmail.com> Committed by superryanguo<superryanguo@gmail.com>: Simplify the Quickstart section with the weblink is better --- Commit 72e2918cc authored by Taehoon Lee<taehoonlee@snu.ac.kr> Committed by Taehoon Lee<taehoonlee@snu.ac.kr>: Fix typos --- Commit 90c4406b7 authored by Rishabh Patel<patelrishabh@users.noreply.github.com> Committed by GitHub<noreply@github.com>: Correct the learning rate as per the code snippet --- Commit 03da61134 authored by Todd Wang<toddwang@gmail.com> Committed by GitHub<noreply@github.com>: Update ir_array.cc --- Commit 2df6cd3ac authored by Todd Wang<toddwang@gmail.com> Committed by GitHub<noreply@github.com>: Another try --- Commit af0cbace1 authored by Luke Iwanski<luke@codeplay.com> Committed by Benoit Steiner<benoitsteiner@users.noreply.github.com>: [OpenCL] Transpose to go through Eigen (#10321) --- Commit fc7361081 authored by Luke Iwanski<luke@codeplay.com> Committed by Benoit Steiner<benoitsteiner@users.noreply.github.com>: [OpenCL] Registers RGBToHSV and HSVToRGB (#91) (#10848) * [OpenCL] Added RGBToHSV and HSVToRGB * Aligning '\' --- Commit 832894ef8 authored by Luke Iwanski<luke@codeplay.com> Committed by Benoit Steiner<benoitsteiner@users.noreply.github.com>: [OpenCL] Registers AdjustContrastv2 (#10949) * [OpenCL] Registers AdjustContrastv2 (#93) * [OpenCL] Extended adjust_contrast_op_benchmark_test for OpenCL (#96) * [OpenCL] Extended adjust_contrast_op_benchmark_test for OpenCL * simplified to #ifndef * Changed to "#if GOOGLE_CUDA" * Update adjust_contrast_op_benchmark_test.cc * Added comments --- Commit cb4c2f8d1 authored by Yifei Feng<yifeif@google.com> Committed by Yifei Feng<yifeif@google.com>: Make TransferBufferToInFeed not virual so it compiles. --- Commit e89f04d80 authored by Yifei Feng<yifeif@google.com> Committed by Yifei Feng<yifeif@google.com>: Fix calling Literal member functions. --- Commit 15a8df724 authored by Yifei Feng<yifeif@google.com> Committed by Yifei Feng<yifeif@google.com>: Fix mac build clone from meheff's change: [XLA] Change return type of DeviceAssignment::Deserialize to fix build breakage on mac. The mac build had the following error: error: incomplete type 'xla::DeviceAssignment' used in type trait expression This was due to a static method returning a StatusOr<DeviceAssignment> inside of the definition of DeviceAssignment. --- Commit a54d43fa4 authored by Yifei Feng<yifeif@google.com> Committed by Yifei Feng<yifeif@google.com>: Replace LiteralUtil to Literal in compiler/plugin/executor --- Commit 88a6bb80c authored by Guenther Schmuelling<guschmue@microsoft.com> Committed by Guenther Schmuelling<guschmue@microsoft.com>: expand inline for debug builds to limit number of symbols --- Commit 62fb49d31 authored by Yifei Feng<yifeif@google.com> Committed by Yifei Feng<yifeif@google.com>: Fix visibility error for contrib/remote_fused_graph/pylib/BUILD. --- Commit 4c75252f2 authored by Mark Neumann<markn@allenai.org> Committed by Mark Neumann<markn@allenai.org>: fix initial test values to avoid numerical instability --- Commit b58d98353 authored by sj6077<epik03sj@gmail.com> Committed by Benoit Steiner<benoitsteiner@users.noreply.github.com>: Fixes of AutoParallel bug (#10368) * Fix the bug that auto_parallel could replicate variable snapshot name * Use NodeName in grappler:utils instead of substr, convert variables->variable_def of grappler item * remove variable_def from grappler item, exclude snapshot nodes from dont_replicate_nodes in auto_parallel --- Commit a286b7db8 authored by Yifei Feng<yifeif@google.com> Committed by Yifei Feng<yifeif@google.com>: Make debug_test slice integer. --- Commit 97fcfdfa6 authored by Toby Boyd<tobyboyd@google.com> Committed by GitHub<noreply@github.com>: Fixed path to seq2seq.py and minor formatting --- Commit 63c1befb8 authored by Anish Shah<shah.anish07@gmail.com> Committed by Anish Shah<shah.anish07@gmail.com>: Improve docs for tf.nn.depthwise_conv2d_native --- Commit 8d42202b2 authored by Yong Tang<yong.tang.github@outlook.com> Committed by Yong Tang<yong.tang.github@outlook.com>: Fix mismatched delete in mkl_tfconv_op.cc This fix fixes mismatched new[]-delete in mkl_tfconv_op.cc (the file went through clang-format so there are some additional changes) Signed-off-by: Yong Tang <yong.tang.github@outlook.com> --- Commit 26301bd55 authored by Danny Goodman<goodman.danny@gmail.com> Committed by Danny Goodman<goodman.danny@gmail.com>: fix error format --- Commit b3f33ad46 authored by Yao Zhang<yaozhang@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Make changes to prepare for the fused option of batch norm to be set to None (None means using fused batch norm if possible). PiperOrigin-RevId: 159649743 --- Commit a4a469832 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: [XLA] Add tests for select ops and while loops that produce tuples that contain predicates. PiperOrigin-RevId: 159645900 --- Commit 980d3f2be authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Use C API to implement Operation.name property This name property is used in many existing tests including those that already run with C API enabled (math_ops_test, framework_ops_test, session_test, session_partial_run_test, math_ops_test_gpu, etc). PiperOrigin-RevId: 159645767 --- Commit 26239c706 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Previously we didn't have an implementation of BatchNormInference and BatchNormTraining, which gives a linker error if anyone ever tries to call that. A dummy implementation is friendlier than a linker error. PiperOrigin-RevId: 159645612 --- Commit f671c5caa authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: BEGIN_PUBLIC Automated g4 rollback of changelist 159570549 PiperOrigin-RevId: 160182040
* Merge changes from github.Gravatar Jonathan Hseu2017-06-09
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | END_PUBLIC --- Commit f0e185d1f authored by Benoit Steiner<bsteiner@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Better handle nodes with a variable number of outputs PiperOrigin-RevId: 158435028 --- Commit bc3e20807 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Remove unused BUILD dependencies PiperOrigin-RevId: 158431059 --- Commit a0c80e4d5 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Delete unnecessary (mistakenly duplicated) logging message. PiperOrigin-RevId: 158428506 --- Commit b6ad1d747 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Adds DNN-only tests for DNNLinearCombinedClassifier. PiperOrigin-RevId: 158423119 --- Commit ddbb58034 authored by Shanqing Cai<cais@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Remove unnecessary pylint disable PiperOrigin-RevId: 158416140 --- Commit fcaa724e2 authored by Luke Iwanski<luke@codeplay.com> Committed by gunan<gunan@google.com>: [OpenCL] Cleans pack and unpack ops (#10336) * [OpenCL] Cleans pack op * [OpenCL] Cleans unpack op --- Commit 2f53cacb2 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Fix a test failure of quantization_utils_test on ASAN PiperOrigin-RevId: 158414538 --- Commit 50b2f951c authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Update ops-related pbtxt files. PiperOrigin-RevId: 158413455 --- Commit 1e90b78e9 authored by Brennan Saeta<saeta@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Add CacheDataset ops. Some input pipelines may pull down data from remote webservers or perform expensive processing. In order to avoid extraneous work, we now support caching the dataset (e.g. on disk). PiperOrigin-RevId: 158411901 --- Commit e16cd2ede authored by Taehoon Lee<taehoonlee@snu.ac.kr> Committed by gunan<gunan@google.com>: Fix typos (#10533) --- Commit 50d80ddf9 authored by Jonathan Hseu<jhseu@google.com> Committed by Jonathan Hseu<jhseu@google.com>: Fix fft_ops_test.py for CPU --- Commit d35cbbb44 authored by Mustafa Ispir<ispir@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Add weight-column support to the heads. PiperOrigin-RevId: 158409180 --- Commit 7fb52cd54 authored by Justin Lebar<jlebar@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Don't crash when displaying XLA metrics if they happen to be negative. PiperOrigin-RevId: 158407664 --- Commit 12a7a752a authored by Jianfei Wang<me@thinxer.com> Committed by Jonathan Hseu<vomjom@vomjom.net>: Add a tip for tf.train.LoggingTensorHook (#10237) `INFO` logs are not printed by default unless in IPython. Add a friendly tip for newcomers. --- Commit 216dcbf1e authored by Luke Iwanski<luke@codeplay.com> Committed by Jonathan Hseu<vomjom@vomjom.net>: [OpenCL] Cleans reduction ops (#10340) * [OpenCL] Cleans reduction_ops_max.cc * [OpenCL] Cleans reduction_ops_mean.cc * [OpenCL] Cleans reduction_ops_min.cc * [OpenCL] Cleans reduction_ops_prod.cc * [OpenCL] Cleans reduction_ops_sum.cc --- Commit 2b351062a authored by Androbin<robin.richtsfeld@gmail.com> Committed by Jonathan Hseu<vomjom@vomjom.net>: Improve docs for selective registration headers (#10351) * Improve docs for selective registration headers progressing #10299 * Update print_selective_registration_header.py * Mention both flags -DSELECTIVE_REGISTRATION and -DSUPPORT_SELECTIVE_REGISTRATION --- Commit ee919510f authored by Yun Peng<pcloudy@google.com> Committed by gunan<gunan@google.com>: Re-enable some python tests in Windows Bazel build (#10526) --- Commit b0e881457 authored by Androbin<robin.richtsfeld@gmail.com> Committed by gunan<gunan@google.com>: [Bash] Declare and assign separately (#10509) As proposed by static analysis tool: https://github.com/koalaman/shellcheck/wiki/SC2155 --- Commit 284901b08 authored by Androbin<robin.richtsfeld@gmail.com> Committed by gunan<gunan@google.com>: [Bash] Remove unquoting quotes (#10506) As proposed by static analysis tool: https://github.com/koalaman/shellcheck/wiki/SC2027 --- Commit 2a1f11556 authored by ksellesk<zhengdachuan200305@gmail.com> Committed by ksellesk<zhengdachuan200305@gmail.com>: Fix AttributeError in resnet.py There is no function tf.softmax() in Tensorflow 1.x. When running the old code, Python interpreter complains: File "resnet.py", line 152, in res_net_model prediction, loss = res_net(x, y) File "resnet.py", line 148, in res_net return tf.softmax(logits), loss AttributeError: 'module' object has no attribute 'softmax' --- Commit 1d68f729b authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Remove unneeded BUILD dependency PiperOrigin-RevId: 158391996 --- Commit 08ed32dbb authored by Yun Peng<pcloudy@google.com> Committed by gunan<gunan@google.com>: Windows: Make TensorFlow build without --cpu=x64_windows_msvc (#10466) * Windows: Make TensorFlow build without --cpu=x64_windows_msvc Since from Bazel 0.5.0, MSVC toolchain became the default toolchain on Windows. So --cpu=x64_windows_msvc is not required as long as we adjust the BUILD files in TensorFlow. --cpu=x64_windows_msvc is also supported for now, but is depracated. The configuration for cpu value x64_windows_msvc is a duplicate of x64_windows, which should be removed in the future. * Fix breakage on macOS --- Commit 02dbe153a authored by Androbin<robin.richtsfeld@gmail.com> Committed by gunan<gunan@google.com>: [Bash] Simplify Conditional (#10503) --- Commit c07bc581f authored by Androbin<robin.richtsfeld@gmail.com> Committed by gunan<gunan@google.com>: [Bash] Prefer read -a to split path (#10508) As proposed by static analysis tool: https://github.com/koalaman/shellcheck/wiki/SC2207 --- Commit 0a389674d authored by Androbin<robin.richtsfeld@gmail.com> Committed by gunan<gunan@google.com>: [Bash] Prefer [ p ] && [ q ] over [ p -a q ] (#10507) As proposed by static analysis tool: https://github.com/koalaman/shellcheck/wiki/SC2166 --- Commit 87a008ec3 authored by Jonathan Hseu<vomjom@vomjom.net> Committed by gunan<gunan@google.com>: Delete non-deterministic testEmpty() test (#10512) --- Commit 3a2971bd8 authored by Frank Chen<frankchn@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Adds the base for ClusterResolvers, a new way of communicating with and retrieving cluster information for running distributed TensorFlow. Implementations of this class would eventually allow users to simply point TensorFlow at a cluster management endpoint, and TensorFlow will automatically retrieve the host names/IPs and port numbers of TensorFlow workers from the cluster management service. PiperOrigin-RevId: 158358761 --- Commit 28b4e7f04 authored by Jonathan Hseu<vomjom@vomjom.net> Committed by gunan<gunan@google.com>: Disable stage_op_test and map_stage_op_test (#10516) --- Commit 390e57a75 authored by Yan (Asta) Li<yanastali@users.noreply.github.com> Committed by Benoit Steiner<benoitsteiner@users.noreply.github.com>: Check EIGEN_MAX_ALIGN_BYTES to prevent mod-by-0 (#10380) * Check EIGEN_MAX_ALIGN_BYTES to prevent mod-by-0 If EIGEN_MAX_ALIGN_BYTES is set to 0, alignment checks that mod by EIGEN_MAX_ALIGN_BYTES fail at runtime. * Returns true, as in tensorflow/core/framework/tensor.h * Update unit tests * Enable tests only if EIGEN_MAX_ALIGN_BYTES > 0 --- Commit cd5ac40b3 authored by Peter Hawkins<phawkins@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: [XLA] Update LLVM to upstream revision r304927. Add LLVM build rules for the LLVM AMDGPU backend, commented out by default. Fixes issue #10437. PiperOrigin-RevId: 158351480 --- Commit 91cb809bd authored by David Norman<DavidNorman@users.noreply.github.com> Committed by Jonathan Hseu<vomjom@vomjom.net>: [XLA] Add ability to run the XLA unit tests against a different device (#9759) * Add ability to run the XLA unit tests against a different device * Allow for multiple extra backend devices * Correct merge error * Include options for additional tags --- Commit aff4d124b authored by Yuxin Wu<ppwwyyxxc@gmail.com> Committed by Jonathan Hseu<vomjom@vomjom.net>: Compare base_dtype instead of dtype in piecewise_constant (#10280) * Compare base_dtype instead of dtype in piecewise_constant Compare base_dtype instead of dtype in piecewise_constant. Fix #10086 * add unit test * Small lint fix and comment --- Commit 845539f98 authored by Jianwei Xie<xiejw@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Add evaluation test for linear classifier (n==2 or n >2). PiperOrigin-RevId: 158340296 --- Commit 7c46214ab authored by Jonathan Hseu<vomjom@vomjom.net> Committed by GitHub<noreply@github.com>: Fix numpy 1.13 incompatibilities (#10501) * Fix numpy 1.13 incompatibilities * Skip tests with numpy 1.13.0 --- Commit 4572c41df authored by gunan<gunan@google.com> Committed by Jonathan Hseu<vomjom@vomjom.net>: A few changes to kernel_tests. (#10502) * Disable reader_ops_test on windows. * Run buildifier on kernel_tests/BUILD * Mark map_stage_op_test as large. * Set the size of stage_op_test to large --- Commit 892293d98 authored by Brennan Saeta<saeta@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Set a default for datasets end_of_sequence. While all datasets carefully set the end_of_sequence to true at the appropriate time, some datasets might forget to set it to false in the normal case. In order to avoid potential undefined behavior, we set the end_of_sequence variable to be false by default. PiperOrigin-RevId: 158337799 --- Commit 187404eac authored by Benoit Steiner<bsteiner@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Setup the env to since ops such as MatchFileOp rely on it. PiperOrigin-RevId: 158336344 --- Commit 2741561c8 authored by Justine Tunney<jart@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Fix up vz_projector script structure We now make sure scripts and HTML imports are declared in the correct places. In the future, pedantically listing script tags should not be necessary. PiperOrigin-RevId: 158334306 --- Commit beeaade46 authored by Kay Zhu<kayzhu@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Resubmit a reverted change. Original description: [XLA] Enable HloEvaluator for constant folding, also merged a few operations from hlo_constant_folding to hlo_evaluator. Additionally: - In ShapeUtil::ForEachIndex: * fix a bug where visitor is called when the shape has zero elements (e.g., F32{1,0}) * added test case for ForEachIndex. - In HloEvaluator: * Instead of copying and caching a Constant instruction, return the literal directly if the instruction is constant. * Fix an issue where TUPLE and OPAQUE primitives are not keyed in the templated typed_visitor. * Use (fixed) LiteralUtil::Populate to populate resulting literal, fixes the preexisting bug in the evaluator where R0 and shape with zero size dimensions are not handled. * Refactor ElementWiseUnaryOp and HandleCompare to be templatized on the operand's type. * Refactor IsFinite to be top level since it is only applicable to floats and the return type is always boolean. * Change from std::remainder to std::fmod for kRemainder to be compliant with existing XLA behavior. * Change from std::max and std::min to std::fmax and std::fmin to handle NaNs. * Minor comments fix. PiperOrigin-RevId: 158330052 --- Commit b94540e6f authored by Toby Boyd<tobyboyd@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: tf.layers.conv2d use_bias=True to use nn.bias_add PiperOrigin-RevId: 158326493 --- Commit 379aa9911 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Go: Update generated wrapper functions for TensorFlow ops. PiperOrigin-RevId: 158325855 --- Commit 4e529f0f1 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Update ops-related pbtxt files. PiperOrigin-RevId: 158325293 --- Commit 0a9d2dac0 authored by Yuefeng Zhou<yuefengz@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Add a util function in virtual placer to return canonicalized device string, which can be used to fix the node's device field before passing them to the maxcut algorithm. PiperOrigin-RevId: 158322753 --- Commit 2d8da1d9b authored by Daniel Ylitalo<daniel@blodan.se> Committed by gunan<gunan@google.com>: Recognize CPU core count in FreeBSD (#10490) --- Commit c19e6cac0 authored by Peter Hawkins<phawkins@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: [TF:XLA] Initial implementation of TensorArray ops. The XLA implementation of TensorArrays is more restrictive than regular TensorArrays: * XLA TensorArrays must have dynamic_size=False. * all elements in an XLA TensorArray must have the same shape. * writes always add their values to any existing values; neither reads nor writes ever issue errors. Out-of-bounds writes currently wrap. Refactor Variable handling in the TF/XLA bridge. Use a XlaVariable* to refer to variables inside compilation rather than a numerical ID. Allow for variables that don't correspond to variables known to the user. Also use XlaVariable to handle TensorArrays. PiperOrigin-RevId: 158322041 --- Commit b5e8d3086 authored by Peter Hawkins<phawkins@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: [TF:XLA] Refactor randomized tests to allow testing of larger inputs without running out of memory. PiperOrigin-RevId: 158321431 --- Commit 5d90bbaac authored by Kay Zhu<kayzhu@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: [XLA] Disable constant_folding in test base, so that intended test code paths would not be elided by constant_folding pass. PiperOrigin-RevId: 158317641 --- Commit 036ce8ba6 authored by Luke Iwanski<luke@codeplay.com> Committed by gunan<gunan@google.com>: [OpenCL] Cleans dense_update_ops (#10335) * [OpenCL] Cleans dense_update_ops * Acts on feedback from: #10335#discussion_r120536460 --- Commit 85f968125 authored by Luke Iwanski<luke@codeplay.com> Committed by gunan<gunan@google.com>: [OpenCL] Cleans cast operation (#10330) * [OpenCL] Removes not needed typedef for SYCLDevice * [OpenCL] Fixes formatting * [OpenCL] use SYCLDevice for int32 cast case --- Commit bff5e72da authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Fix typo. PiperOrigin-RevId: 158310742 --- Commit 38249d6be authored by Shanqing Cai<cais@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Swap the order of NanTensorHook and custom hooks to ensure that when the training encounteres NaN's in the loss function, user-supplied hooks such as tf_debug.LocalCLIDebugHook can still be used to debug the root cause of the numeric issues. PiperOrigin-RevId: 158310249 --- Commit 599727c65 authored by Eli Bendersky<eliben@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: [XLA] Propagate debug option flags to hlo_test_base. Specific HLO tests have to replace the generic test_main target with a manual main() that invokes RUN_ALL_TESTS. To get access to a module with debug options set up, a new convenience method is created on HloTestBase. Initially algebraic_simplifier_test is modified as a canary; in a followup we'll convert all HLO tests to this approach. PiperOrigin-RevId: 158309488 --- Commit 0770393e9 authored by Eric Liu<ioeric@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: [Tensorboard] Add a trace viewer component to TensorBoard. We make the trace viewer a separate app; otherwise, there would be dependency conflicts (e.g. Polymer) between the trace viewer app and the tensorboard app. The trace viewer app would be served by a plugin, and Tensorboard dashboard will integrate trace viewer app using iframe in the future. This CL also added "mominify" support for link import HTML tags in the tensorboard home-grown java vulnizer; otherwise, the vulcanized trace viewer code would crash the java vulcanizer. For open-source build, we add a denpendency on the Catapult github repository (https://github.com/catapult-project/catapult/tree/master/tracing). We use a bazel genrule to vulcanize a trace viewer binary which is then used in the tf-trace-viewer component. PiperOrigin-RevId: 158309408 --- Commit 85e832201 authored by RJ Ryan<rjryan@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Support unknown emit shapes in tf.nn.raw_rnn. PiperOrigin-RevId: 158308002 --- Commit edb5fed7f authored by Mustafa Ispir<ispir@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Add label-vocab support to binary logistic head. Add assertion that binary classifier label is in range [0., 1.] Fixed Classifier Integration tests. PiperOrigin-RevId: 158307521 --- Commit f8e1cf8fa authored by Justine Tunney<jart@google.com> Committed by Jonathan Hseu<vomjom@vomjom.net>: Open up visibility of tf_imports (#10500) This also fixes the definition of Clutz. --- Commit 9fd7cf054 authored by Luke Iwanski<luke@codeplay.com> Committed by Jonathan Hseu<vomjom@vomjom.net>: [OpenCL] Cleans relu ops (#10343) * [OpenCL] register relu ops to gpu types (no half) * [OpenCL] Removes #undef EIGEN_USE_SYCL --- Commit 09c1455e3 authored by Luke Iwanski<luke@codeplay.com> Committed by Jonathan Hseu<vomjom@vomjom.net>: [OpenCL] Cleans reverse_op.cc (#10346) --- Commit b7892a30f authored by orome<royl@aldaron.com> Committed by Jonathan Hseu<vomjom@vomjom.net>: Clarify tf.matmul documentation (#10381) * Update math_ops.py * Fix non-ascii character --- Commit 9786b7062 authored by Luke Iwanski<luke@codeplay.com> Committed by Benoit Steiner<benoitsteiner@users.noreply.github.com>: [OpenCL] Cleans StridedSlice Op (#10314) * [OpenCL] Cleans StridedSlice Op * [OpenCL] Removes half from registred types --- Commit f105df047 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: In the CUDA path of depthwise_conv2d, optimize backward filter convolution for images 2 or 4 times smaller than 16x16. Also initialize in_cols from blockDim, to fix the regression caused in CL 157906773. PiperOrigin-RevId: 158296136 --- Commit 492afc2e3 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Go: Update generated wrapper functions for TensorFlow ops. PiperOrigin-RevId: 158295169 --- Commit abe0877ef authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Add bazel version check to .configure PiperOrigin-RevId: 158294569 --- Commit b702e7e79 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Update ops-related pbtxt files. PiperOrigin-RevId: 158294289 --- Commit 94085bee7 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Replace std::function object with regular function. The function is called recursively, and the std::function object had only existed to allow recursion from within a lambda expression. A regular function should be cheaper than a polymorphic function wrapper. PiperOrigin-RevId: 158292415 --- Commit ba656b261 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Use template specialization instead of overloaded methods. This is a more appropriate tool here. NFC PiperOrigin-RevId: 158292035 --- Commit 55f987692 authored by Yutaka Leon<yleon@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Make tf.contrib.lookup python functions use the kernels v2 that uses the resource tensor as handler. PiperOrigin-RevId: 158291836 --- Commit ebae3deba authored by Wei Ho<weiho@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Switch back to max_num_rows_to_load instead of reading slice by slice due to performance regression from network overhead. Add check when using initializing values to avoid seg fault PiperOrigin-RevId: 158291218 --- Commit 7b4c01794 authored by RJ Ryan<rjryan@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Support numpy-style padding and slicing of tf.spectral.rfft/irfft to match the desired FFT length. Fixes incorrect RFFT/IRFFT results when fft_length does not match the input dimension. PiperOrigin-RevId: 158289991 --- Commit fdb8e2935 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Update iOS examples to use CocoaPods, and moved to tensorflow/examples/ios PiperOrigin-RevId: 158289285 --- Commit d86167b5f authored by Amit Patankar<amitpatankar@google.com> Committed by Amit Patankar<amitpatankar@google.com>: Merging rc2 back into master. --- Commit dffea202a authored by Eli Bendersky<eliben@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Clean up some code after previous CL PiperOrigin-RevId: 158282834 --- Commit 7b5302af0 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Adds ability to set a "family" attribute in Tensorflow summaries, which controls the "tab name" of the summary that is displayed. This solution keeps using name_scope to keep names unique, but then prefixes the tag with the family name if provided. PiperOrigin-RevId: 158278922 --- Commit 611c82b5b authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Adds integration test for DNNLinearCombined((Classifier)|(Regressor)). PiperOrigin-RevId: 158278512 --- Commit cc6c91a9a authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Remove a further unused proto header inclusion PiperOrigin-RevId: 158278026 --- Commit 9f17c26ca authored by Mark Heffernan<meheff@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: [XLA] Add HloLocation to dataflow analysis. Add an HloLocation abstraction to dataflow analysis which indicates where (in the output of what instruction and at which index) an HloValue may appear. Previously only uses were stored with an HLO value where a use is an edge in the HLO graph (instruction, operand number and ShapeIndex). Also, change the handling of tuple-shaped kSelect instructions when ssa_form is true. Previously a phi value would be created. With this change the the value set instead contains the union of it's inputs identical to the ssa_form=false case. PiperOrigin-RevId: 158276598 --- Commit b9d5e1441 authored by Eli Bendersky<eliben@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: [XLA] Start collecting flags for debug options in a single place. ClientLibraryTestBase will now parse command-line flags for debug options automatically, permitting subclasses to override certain options by using mutable_debug_options. main() still has to call AppendDebugOptionsFlags() explicitly before running the TF flag parser. In the mean-time, this CL leaves flag handling to the current "legacy" approach. However, this is part of a larger plan to move *all* debugging flags for XLA into the DebugOptions message and expose them as flags from a single place. The other flags (which are not controlling debugging options) will have to be propagated more explicitly. PiperOrigin-RevId: 158276294 --- Commit 3b6fe94bb authored by Benoit Steiner<bsteiner@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Properly handle shape nodes that have a preexisting control dependency PiperOrigin-RevId: 158274845 --- Commit 1d67379d5 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Minor cleanup PiperOrigin-RevId: 158268933 --- Commit 41997756c authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Sort header inclusions; define EIGEN_USE_THREADS where headers depend on it. PiperOrigin-RevId: 158267803 --- Commit 85355f015 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Add missing header inclusion PiperOrigin-RevId: 158265934 --- Commit 3cf88d390 authored by Gunhan Gulsoy<gunan@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: When GPU is configured, do not require --config=cuda. Also fix indentation in configure. PiperOrigin-RevId: 158232959 --- Commit f48673b50 authored by Luke Iwanski<luke@codeplay.com> Committed by gunan<gunan@google.com>: [OpenCL] Removes ReductionFunctor for SYCLDevice (#10326) We are using Eigen implementation --- Commit 1b6453bec authored by Joan Puigcerver<joapuipe@gmail.com> Committed by gunan<gunan@google.com>: Fixes issue #10258 (#10366) On CUDA versions previous to 8.0, only __shared__ variables could be declared as static in the device code. --- Commit cd56a638d authored by Beomsu Kim<123bskim@naver.com> Committed by Jonathan Hseu<vomjom@vomjom.net>: Fixed wrong range in docstring (#10272) --- Commit d13ae380c authored by Micha? Jastrz?bski<michal.jastrzebski@intel.com> Committed by Jonathan Hseu<vomjom@vomjom.net>: Fix CMD in Dockerfile (#10444) Currently Notebook fails execution because default user for this container is root, and unless explicitly allowed, jupyter notebook will not start. --- Commit 8118ab4ec authored by Simon Perkins<simon.perkins@gmail.com> Committed by Jonathan Hseu<vomjom@vomjom.net>: Support partial gets in MapStagingArea (#10276) * Modify map staging area tests - size from `small` to `medium` - introduce 2 shards * Add partial get support in MapStagingArea A partial list of tensors in a (key, value) map entry can now be requested. Once all tensors associated with the entry are removed, it is removed from the map. * Correct output/indices mismatch errors * Rename IncompleteTuple to OptionalTuple * Add partial get test with indices * Add some more index checks * Improve stage test case graph creation Test sessions (and default graphs) are reused by default. Create explicit, finalized graphs in each test to prevent possible interactions between stateful Staging Areas and others ops created in separate tests. * Make staging area tests small and remove shards They were originally made 'medium' to ameliorate timeouts in the test case, but they usually run in ~1s so they should be small. * Improve imports Avoid importing base tensorflow package * Support both python 2 and python 3 range. * Set map_stage_op_test to size=large * Convert the tests to size=medium --- Commit 0df102b0a authored by Androbin<robin.richtsfeld@gmail.com> Committed by Jonathan Hseu<vomjom@vomjom.net>: Update `configure` script sample (#10455) The `configure` script was changed regularly since the generation of the sample. This PR updates the sample to reflect those changes. --- Commit f6dc1ac61 authored by Earthson Lu<Earthson.Lu@gmail.com> Committed by Jonathan Hseu<vomjom@vomjom.net>: MKL_INSTALL_PATH should not be ignore when given (#10180) * MKL_INSTALL_PATH should not be clear when given * fix overwrite by default --- Commit 8ad6a036e authored by Asim Shankar<ashankar@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Java: Update Maven release to 1.2.0-rc2 PiperOrigin-RevId: 158212897 --- Commit 15eddf035 authored by Fritz Obermeyer<fritz.obermeyer@gmail.com> Committed by Jonathan Hseu<vomjom@vomjom.net>: Export C API symbols in _pywrap_tensorflow_internal.so (#10469) * Export C API symbols * Export C API symbols under config:default --- Commit 754e12668 authored by Luke Iwanski<luke@codeplay.com> Committed by Jonathan Hseu<vomjom@vomjom.net>: [OpenCL] Removes half concat op registration (#10331) --- Commit cfdc22dee authored by Peng Yu<yupbank@users.noreply.github.com> Committed by Jonathan Hseu<vomjom@vomjom.net>: fix the error (#10293) --- Commit 58747e357 authored by Joel Hestness<jthestness@gmail.com> Committed by Jonathan Hseu<vomjom@vomjom.net>: PhiloxRandom: Fix race in GPU fill function (#10298) * PhiloxRandom: Fix race in GPU fill function The PhiloxRandom fill kernel for the GPU had race conditions that caused the outputs to be non-deterministic. In particular, the code previously executed with N GPU threads (# thread contexts per GPU), but it would only advance the fill addresses by N-1 stride in each step. This incorrect stride caused the 0th and N-1st threads to write to the same memory locations, racing for which was last to write their common locations. Make the stride equal to the number of threads to eliminate the race. BONUS: By fixing this race, PhiloxRandom constant-sized GPU initializers now match CPU initializers. * Update random_ops_test.py to find race conditions Increasing the size of arrays in the random_ops_test.py test to manifest the race conditions to be resolved. --- Commit 2cbcda08f authored by Androbin<robin.richtsfeld@gmail.com> Committed by Jonathan Hseu<vomjom@vomjom.net>: Fixed formatting in Linux install guide (#10353) Formatting issues were introduced in PR #8825, commit f30918b3694afe844990cbddc82e27e023d88856 --- Commit ab5f38560 authored by Lakshay Garg<lakshayg@outlook.in> Committed by Jonathan Hseu<vomjom@vomjom.net>: Fixed typos in documentation & READMEs (#10365) --- Commit 94dc1dbfa authored by Christos Nikolaou<cNikolaou@users.noreply.github.com> Committed by Jonathan Hseu<vomjom@vomjom.net>: Enable figures in the tfprof README.md (#10372) --- Commit 3018d4678 authored by Taehoon Lee<taehoonlee@snu.ac.kr> Committed by Jonathan Hseu<vomjom@vomjom.net>: Fix typos (#10386) --- Commit c5f3c6171 authored by Daniel Rasmussen<drasmuss@users.noreply.github.com> Committed by Jonathan Hseu<vomjom@vomjom.net>: Fix unbatch for Datasets with multiple elements (#10401) * Fix unbatch for datasets with multiple elements * fixup! pylint (indent two spaces instead of four) --- Commit 8b065bc10 authored by Yong Tang<yong.tang.github@outlook.com> Committed by Jonathan Hseu<vomjom@vomjom.net>: Fix unaligned args in api_docs/python/tf/contrib/learn/Evaluable (#10423) This commit fixes unaligned args in api_docs/python/tf/contrib/learn/Evaluable Signed-off-by: Yong Tang <yong.tang.github@outlook.com> --- Commit 8f89b654f authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Profile memory usage in VirtualScheduler and report peak memory usage. To do so, NodeState now handles different output ports of a node (in case a node has multiple outputs). Also, VirtualScheduler code is cleaned up with more comments. PiperOrigin-RevId: 158209068 --- Commit 0ea0bf5aa authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Add a frontend for viewing the first ops that exhibit bad values (NaN, +/- Inf). This helps the user identify problematic ops. Also moved the debugger data logic within tf-graph-info into a new tf-graph-debugger-data-card component. PiperOrigin-RevId: 158208679 --- Commit ed47ecf2d authored by Luke Iwanski<luke@codeplay.com> Committed by Benoit Steiner<benoitsteiner@users.noreply.github.com>: [OpenCL] Cleans variable op (#10333) * [OpenCL] Cleans variable op * Fixes formatting and float / double -> GPU_NUMBER_TYPES_NO_HALF --- Commit 9b2c1af63 authored by Luke Iwanski<luke@codeplay.com> Committed by Benoit Steiner<benoitsteiner@users.noreply.github.com>: [OpenCL] Improves device reporting (#10462) Prints: id, type, name, vendor and profile of the device --- Commit 7f5384dcc authored by Alexandre Passos<apassos@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Making load() work for resource variables. PiperOrigin-RevId: 158205361 --- Commit 05412bd36 authored by Mark Heffernan<meheff@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: [XLA] Simplify Shape traversal visitors. Simplify shape traversal visitors in ShapeUtil and ShapeTree. Add a non-Status form because most uses of the traversal methods do not use it, and remove is_leaf parameter from ShapeTree.ForEach* as it is not frequently used. PiperOrigin-RevId: 158201574 --- Commit 69c9365b4 authored by Mustafa Ispir<ispir@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Extracted linear estimator testing utils to be reused by dnn-linear-combined. Added tests for linear part of dnn-linear-combined estimator. PiperOrigin-RevId: 158200827 --- Commit 65ce8c723 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Add arrowheads to dataflow edges. Make reference edges orange. Remove animations from tooltips in the graph documentation. Previously, arrowheads were only added to reference edges (because we assumed users knew about the convention that arrowless edges flow upwards). That decision nicely reduces clutter. However, recently, some internal and external folks have expressed confusion, and so I want to try adding arrowheads to all data flow edges. And make the reference edges starkly different. See #10428 PiperOrigin-RevId: 158195388 --- Commit bf4c3dd6b authored by gunan<gunan@google.com> Committed by GitHub<noreply@github.com>: Revert "Fix patching issue on Windows" (#10472) This reverts commit 47e6785646a1266f01a1a570bd799f8518ee2997. --- Commit b49515539 authored by David Soergel<soergel@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Add only string constants to ASSET_FILEPATHS collection. PiperOrigin-RevId: 158192152 --- Commit 51acad09c authored by Sergio Guadarrama<sguada@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Add tests with different delta to huber_loss. PiperOrigin-RevId: 158191361 --- Commit a4e7b7add authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Fixes a bug in setting default optimizers for DNNLinearCombinedClassifier. PiperOrigin-RevId: 158190192 --- Commit ddd67e333 authored by Luke Iwanski<luke@codeplay.com> Committed by Jonathan Hseu<vomjom@vomjom.net>: [OpenCL] Cleans reshape.cc (#10347) * [OpenCL] Cleans reshape.cc * Removes half and complex numbers. Half is extension and complex numbers needs implementation in Eigen first --- Commit 3ca653304 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Update ops-related pbtxt files. PiperOrigin-RevId: 158186454 --- Commit 8cda8660e authored by Luke Iwanski<luke@codeplay.com> Committed by gunan<gunan@google.com>: [OpenCL] Cleans sendrecv_ops.cc (#10345) --- Commit 6915bb919 authored by Luke Iwanski<luke@codeplay.com> Committed by gunan<gunan@google.com>: [OpenCL] Cleans Slice op (#10341) --- Commit 54998b45d authored by Michele Colombo<m-colombo@users.noreply.github.com> Committed by Jonathan Hseu<vomjom@vomjom.net>: BasicRNNCell comment fix (#10467) --- Commit df5906fb7 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Mark saver/restore ops that depend on filesystem as stateful to disable them from being folded into a constant by graph optimizer. PiperOrigin-RevId: 158182282 --- Commit 96cb4d182 authored by Sergio Guadarrama<sguada@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Add support of scale_l1 == 0. or scale_l2 == 0 to l1_l2_regularizer. Added tests. PiperOrigin-RevId: 158179790 --- Commit b65eb3f9b authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Speed up atrous_convolution_test by combining evaluations. To make this test run faster (and prevent it from timing out under certain circumstances), this change combines all evaluations for each test method into a single call to Session.run, to eliminate overhead. This reduces the test time from about 40 seconds to 10 seconds. RELNOTES: n/a PiperOrigin-RevId: 158175227 --- Commit b440abce7 authored by Gao, Xiang<qasdfgtyuiop@gmail.com> Committed by Rasmus Munk Larsen<rmlarsen@google.com>: add Cuda{2D,3D}LaunchConfig that maximizes occupancy (#10032) * add Cuda{2D,3D}LaunchConfig that max occupancy * remove default val, check input<=0 * add max size check * fix typo * tests, docs, and related changes * build the test * buildify * cudaOccupancy... call check success, and style fix --- Commit 81cf61fdb authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Initialize tensor in graph_properties_test, to avoid msan complaint. PiperOrigin-RevId: 158169374 --- Commit cabc5c35c authored by Eli Bendersky<eliben@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: [XLA] Add xla_disable_hlo_passes to DebugOptions Also add a SetDebugOptions method to ClientLibraryTestBas; this lets us set debug options in tests by calling it. As an example, this CL removes the current way of passing xla_disable_hlo_passes programmatically in tests - it used to employ a special constructor parameter which is no longer required. PiperOrigin-RevId: 158169006 --- Commit 187d23337 authored by Luke Iwanski<luke@codeplay.com> Committed by gunan<gunan@google.com>: [OpenCL] Cleans Pad op (#10339) --- Commit e8bc38ef6 authored by gunan<gunan@google.com> Committed by GitHub<noreply@github.com>: Fix test failures on windows. (#10470) --- Commit 2b3535c64 authored by David Soergel<soergel@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Minor docstring fix for build_parsing_serving_input_receiver_fn PiperOrigin-RevId: 158163615 --- Commit e55f2e036 authored by Benoit Steiner<bsteiner@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Propagates constants through switch nodes. PiperOrigin-RevId: 158163537 --- Commit b01d4b905 authored by Jacques Pienaar<jpienaar@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: [XLA] Remove outdated todo. PiperOrigin-RevId: 158161411 --- Commit 7125733d7 authored by William Chargin<wchargin@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Create a set of sample data for the audio plugin This implements a simple tone generator, with sine waves, square waves, and triangle waves, plus two simple combinations of sine waves. The step value is used to control the frequency. PiperOrigin-RevId: 158160889 --- Commit dc81a2420 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Updates to the WALSMatrixFactorization estimator: - Add a completed_sweeps variable to keep track of sweeps that have been completed during training. - Add a StopAtSweepHook, which can request a stop after completing a specified number of sweeps. PiperOrigin-RevId: 158156347 --- Commit 74220616c authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Set device cores and frequency in op_level_cost_estimator_test, to avoid asan error about assigning inf to int64 (this comes in from a divide-by-0). PiperOrigin-RevId: 158155488 --- Commit 47e678564 authored by Yun Peng<pcloudy@google.com> Committed by gunan<gunan@google.com>: Fix patching issue on Windows (#10452) --- Commit 6d54f09d9 authored by Yun Peng<pcloudy@google.com> Committed by gunan<gunan@google.com>: Fix linking errors of lmdb on Windows (#10457) --- Commit 61c8a745b authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Minor cleanup: Add braces around if statement arms; remove redundant "return" and "static". PiperOrigin-RevId: 158143418 --- Commit e9a889c5e authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Pass int parameter by value, not by const reference PiperOrigin-RevId: 158142102 --- Commit 9184726ed authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Avoid unnecessary copying of map data during visitation PiperOrigin-RevId: 158141962 --- Commit 2e7e1d57b authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Small fix for how std::move is used in constructors PiperOrigin-RevId: 158141564 --- Commit 2a61c1652 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: In cpu compiler's CompileAheadOfTime, pass ordering when compiling entry computation. PiperOrigin-RevId: 158140349 --- Commit f3f53e8b3 authored by Derek Murray<mrry@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: [tf.contrib.data] Add support for dicts and remove lists from nested structures. This changes the behavior of constructors like `tf.contrib.data.Dataset.from_tensors()` when passed a list. Previously, the `nest` utility would recurse into each element of such a list and create a separate Dataset component. Now the list will be converted to a tensor, allowing code like: ```python dataset = tf.contrib.data.Dataset.from_tensor_slices(([1, 2, 3], [4, 5, 6])) ``` ...to define a dataset with two components (each of shape `()`). This change also adds support for dictionaries as nested structures, which simplifies integration with dictionary-returning ops like `tf.parse_example()`. Fixes #10151. RELNOTES: Breaking change to `tf.contrib.data.Dataset` APIs that expect a nested structure. Lists are now converted to tf.Tensor implicitly. You may need to change uses of lists to tuples in existing code. In addition, dicts are now supported as a nested structure. PiperOrigin-RevId: 158139467 --- Commit b6a8848c1 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Enabling python configuration to use a remotely generated configuration that is located inside of the org_tensorflow repo (previously it *had* to be a remote repo declared in workspace file). PiperOrigin-RevId: 158138601 --- Commit 0fe0bfcc3 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Remove unused protobuf header inclusions PiperOrigin-RevId: 158120864 --- Commit f0c4c6c3f authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: In the CUDA path of depthwise_conv2d, add a fast NCHW backward filter convolution for images smaller than 16x16. PiperOrigin-RevId: 158111294 --- Commit 8dcf37b47 authored by Jon Malmaud<malmaud@gmail.com> Committed by gunan<gunan@google.com>: Fix typo (#10379) --- Commit 3039d7da2 authored by Androbin<robin.richtsfeld@gmail.com> Committed by gunan<gunan@google.com>: Remove "bazel clean" (#10318) Reverting #8880 (see #10236) unnecessary since bazelbuild/bazel#2759 was merged --- Commit ae1c16ae8 authored by Yifei Feng<fengyifei2026@gmail.com> Committed by gunan<gunan@google.com>: Update docker to cudnn6. (#10307) * Update docker to cudnn6. * Update Dockerfile.gpu * Add --expunge to bazel clean to make cuda_configure run again and update TF_CUDNN_VERSION. * Remove expunge and set CUDA and CUDNN version default in configure. * Update configure * Only set --action_env once * Update prints for default version. --- Commit 232e9d86d authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: tf_workspace() claims that the tf_repo_name argument is unused. temp_workaround_http_archive still requires it. This change silences the spurious message. PiperOrigin-RevId: 158089834 --- Commit cc1a02d37 authored by Francois Chollet<fchollet@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Add fp16 support to convolutional layers that support it. PiperOrigin-RevId: 158086284 --- Commit 7d3fbba48 authored by Mustafa Ispir<ispir@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Extracted dnn estimator testing utils to be reused by dnn-linear-combined. Added tests for dnn part of dnn-linear-combined estimator. PiperOrigin-RevId: 158084898 --- Commit 9d12c629c authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Refactor the document and some polishment PiperOrigin-RevId: 158083952 --- Commit 134138299 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Corrected comment: import_scoped_metagraph does not return a Saver. PiperOrigin-RevId: 158082288 --- Commit a58553e4d authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Add function in shape inference to try to infer output tensor content based on the input shapes of the op. In some cases (E.g: shape), knowing the shapes of the input is all that is necessary to infer the content of the output tensor. This improves shape inference. PiperOrigin-RevId: 158079306 --- Commit 0cc851c08 authored by Yuefeng Zhou<yuefengz@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Call maxcut algorithm in the model_based_cost_estimator. PiperOrigin-RevId: 158078511 --- Commit 7d76a90be authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Add question marks next to items in the graph legend. PiperOrigin-RevId: 158076005 --- Commit 68fdb7628 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Add DNNLinearCombinedClassifier. PiperOrigin-RevId: 158075939 --- Commit 3d52e4cb9 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Fix create_meta_graph to respect an empty collection_list. PiperOrigin-RevId: 158073112 --- Commit 54ccc3e5a authored by Mark Heffernan<meheff@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: [XLA] Add module-scoped HLO dataflow analysis. This is the first step to replacing TuplePointsToAnalysis with a global, module-scoped analysis. This dataflow analysis identifies all values and their defs and uses in the XLA graph. The analysis is currently unused. Follow up CLs will add buffer alias analysis using this dataflow analysis, and incrementally switch the transformation passes (for example, CopyInsertion) to use these new module-scoped analyses. PiperOrigin-RevId: 158067910 --- Commit 93c57c6e4 authored by Benoit Steiner<bsteiner@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Handle control flow logic properly: * Don't fold enter/exit nodes since that can interact badly with frames * Create proper control dependencies on switch nodes PiperOrigin-RevId: 158066691 --- Commit 9e6899720 authored by Jingyue Wu<jingyue@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: [SE] Add cudnnTransformTensor to StreamExecutor. PiperOrigin-RevId: 158062553 --- Commit 827874c30 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: In the CUDA path of depthwise_conv2d, add a fast NCHW backward input convolution for images smaller than 16x16. PiperOrigin-RevId: 158061669 --- Commit bee26215c authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Speed up multinomial_op on CPU by using a vectorized Eigen expression and avoiding unnecessary casts. Benchmark with AVX+FMA enabled: Run on <redacted> (12 X 3492 MHz CPUs); 2017-06-05T12:54:07.881672447-07:00 CPU: Intel Haswell with HyperThreading (6 cores) dL1:32KB dL2:256KB dL3:15MB Benchmark Base (ns) New (ns) Improvement ------------------------------------------------------------------ BM_Multinomial_cpu_1_10000_4 250817 172953 +31.0% BM_Multinomial_cpu_1_10000_128 273834 187552 +31.5% BM_Multinomial_cpu_1_10000_10000 1174175 1130778 +3.7% BM_Multinomial_cpu_1_100000_4 2040741 1276761 +37.4% BM_Multinomial_cpu_32_10000_4 10221765 4498666 +56.0% BM_Multinomial_cpu_32_10000_128 10638159 4994754 +53.0% BM_Multinomial_cpu_32_100000_4 100790019 44193314 +56.2% BM_Multinomial_cpu_128_100000_1 431269640 182506078 +57.7% PiperOrigin-RevId: 158061480 --- Commit 515b3ac67 authored by Justine Tunney<jart@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Add Clutz to TensorBoard build This is so we can get JavaScript protobufs. This CL also improves the web_aspect and makes some peculiar Closure Compiler errors go away relating to externs. PiperOrigin-RevId: 158061198 --- Commit 0df6760fe authored by Benoit Steiner<bsteiner@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Added a test to make sure that graph properties for variables are properly reported PiperOrigin-RevId: 158053084 --- Commit 2ccfe8e76 authored by Benoit Steiner<bsteiner@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Added a new method to extract the graph properties from a cost graph without having to run the model. This will simplify the process of creating regression tests PiperOrigin-RevId: 158050327 --- Commit 27f1b80c2 authored by Alexandre Passos<apassos@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Fixes memory leak in py_func when functions return unwrapped strings. PiperOrigin-RevId: 158046530 --- Commit cf238e1f2 authored by Eugene Brevdo<ebrevdo@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Fix memory leak in python caused by @tf_should_use. The issue is that python's GC has trouble collecting objects with __del__ methods. The solution is two pronged: * Keep track of usage state outside of the class, via a dict mapping id(object) => state * Remove __del__ (this was the source: python's GC couldn't collect wrapped objects), and instead use weakref.finalize to emit warnings just as the object is being garbage collected. * Added tests for garbage collection [they were failing before i fixed the issue] PiperOrigin-RevId: 158042388 --- Commit e6f581863 authored by Bo Wang<david.b.wang@gmail.com> Committed by Rasmus Munk Larsen<rmlarsen@google.com>: New reader for LMDB databases (#9950) * Add LMDBReader op and test case * Add testcase to load LMDB from a folder * Add tensorflow/core/lib/lmdb/testdata/data.mdb * Add EOF test * Add license export * Blacklist the test data in pip_smoke_test.py * Address issues with respect to review * Add LICENSE to BUILD rules * Remove the prefx of LICENSE * Wrap key with compat.as_bytes() * Fixed a compilation flag * Improve BUILD rules * Support LMDB build in cmake * Fix BUILD file format with buildifier * Add fake unistd.h for lmdb to build on Windows * Avoid building lmdb tools which depends on unistd.h * Fix the string encoding issue in Python3 * Update lmdb library name in CMakeList.txt --- Commit cc411f938 authored by Yao Zhang<yaozhang@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: When converting the layout of Conv2DBackpropInput, we need to permute one of its inputs, which is a constant node. We permute a copy of this node, instead of the original node, because the original node may be used as input to other nodes. This kind of sharing of const node could arise if the graph is pre-optimized by common subexpression elimination, which is part of the L1 optimizations in TensorFlow. PiperOrigin-RevId: 158037552 --- Commit 88bdb6fca authored by Dandelion Man?<dandelion@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Remove all remaining references to non-public TF modules from TensorBoard. I deleted the PluginAssetUtil tests because that code is deprecated. I'll later add manual testing for backcompat in the text plugin. PiperOrigin-RevId: 158037466 --- Commit 6c531eb2f authored by Francois Chollet<fchollet@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Add file hash to Keras Boston Housing dataset to force cache update. PiperOrigin-RevId: 158036587 --- Commit afdc38cd3 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Remove deprecated resource handle functions in InferenceContext. PiperOrigin-RevId: 158034419 --- Commit 9f932e6ce authored by Derek Murray<mrry@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Avoid parsing a rendezvous key for Send/Recv ops outside a loop. For such ops, the rendezvous key will be constant, because `ctx->frame_iter()` will always evaluate to `{0, 0}`. Benchmarking reveals that this can save between 1 and 2 microseconds per Send or Recv op execution. The optimization applies to all cross-process, inter-device, and intra-device (host-to/from-device memory) Send/Recv ops. PiperOrigin-RevId: 158032522 --- Commit cc2dd4ac8 authored by Shanqing Cai<cais@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: tfdbg: dump debug data from different devices in separate directories Fixes: #7051 wherein TFDBG failed to load the data dump from a Session.run() involving multiple GPUs. The root cause of the bug was that TFDBG previously assumed that node names are unique across all partition graphs. This is however not the case when multiple GPUs exist. The Send/Recv nodes in the partition graphs of the GPUs can have duplicate names. There will potentially be other cases like this in the future due to other reasons (e.g., distributed sessions and/or graph optimization). This CL relaxes this assumption, by dumping the GraphDef and tensor data from different devices into different sub-directories under the dump root directory. PiperOrigin-RevId: 158029814 --- Commit a5909d643 authored by Toby Boyd<tobyboyd@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Fixed triggering create device multiple times PiperOrigin-RevId: 158025196 --- Commit 504a307b7 authored by Martin Wicke<wicke@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Make sure that Adam colocates ops with a consistent variable across workers. PiperOrigin-RevId: 158022292 --- Commit 69ba4d3d4 authored by Asim Shankar<ashankar@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Fix #10371 cpuinfo.get_cpu_info() doesn't seem to include the l2_cache_size key on some architectures. PiperOrigin-RevId: 158021008 --- Commit a51a9846c authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Performance-related tweaks: Don't copy loop variables; remove ineffective std::move casts. PiperOrigin-RevId: 158017670 --- Commit 009789f74 authored by Peter Hawkins<phawkins@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: [XLA] Allow 0-sized slices in DynamicSlice and DynamicUpdateSlice; add tests. PiperOrigin-RevId: 158015870 --- Commit 48a4853eb authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Miscellaneous cleanups PiperOrigin-RevId: 158012131 --- Commit 379ddde24 authored by Chris Song<sjhshy@gmail.com> Committed by Chris Song<sjhshy@gmail.com>: Fix misspells. --- Commit a0a76da97 authored by Lakshay Garg<lakshay.garg.1996@gmail.com> Committed by Lakshay Garg<lakshay.garg.1996@gmail.com>: Fixed typo in code --- Commit 7ffc35732 authored by Eugene Brevdo<ebrevdo@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Add support for bools in matrix_diag, matrix_diag_part, matrix_set_diag, matrix_band_part. PiperOrigin-RevId: 157939272 --- Commit edf3d5dbe authored by Darren Garvey<darren.garvey@gmail.com> Committed by Darren Garvey<darren.garvey@gmail.com>: configure: Fix default path when enabling MPI. Correct showing what the default path is when mpi is installed. --- Commit aad2e3daf authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: In the CUDA path of depthwise_conv2d, add a fast NCHW forward convolution for images smaller than 16x16. PiperOrigin-RevId: 157915637 --- Commit 5cf08d9cb authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Drop blockDim.y for the equivalent in_cols, and slightly improve naming (use 'pixels' instead of 'size' for height*width numbers). PiperOrigin-RevId: 157906773 --- Commit 563f05ff6 authored by Eugene Brevdo<ebrevdo@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: [tf contrib seq2seq] Expand tile_batch to handle nested structures. This allows it to properly tile the initial wrapper state when using BeamSearchDecoder with AttentionWrapper. Unit tests updated to show this use. PiperOrigin-RevId: 157903115 --- Commit 1234e2dda authored by Justine Tunney<jart@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Fix Plottable definition On Mac OS the build directory in the Node package conflicts with BUILD. PiperOrigin-RevId: 157899970 --- Commit bb7a8d8e7 authored by Benoit Steiner<bsteiner@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Don't use the _output_shape attribute in the op_level_cost_estimator since there is no guaranty that it will be present or accurate. PiperOrigin-RevId: 157898989 --- Commit 6f4204c3d authored by Justine Tunney<jart@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Fix TensorBoard SHA256 in cmake PiperOrigin-RevId: 157897958 --- Commit c9d2f432b authored by Justine Tunney<jart@google.com> Committed by Justine Tunney<jart@google.com>: Fix TensorBoard SHA256 in cmake --- Commit 1c70fb686 authored by Jianwei Xie<xiejw@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Add training test for multi classes (n>2) linear classifier. PiperOrigin-RevId: 157896002 --- Commit 675d36be0 authored by Yao Zhang<yaozhang@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Add fused batch norm to tf.layers. PiperOrigin-RevId: 157893874 --- Commit f37d0ea47 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Internal change -- first draft docs PiperOrigin-RevId: 157891937 --- Commit 9b8f6113b authored by Zongheng Yang<zongheng@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: tensor_bundle: fix that the read path forgets to cache file handles. In a case where a reader is geographically far from the file, this change achieves a speedup of end-to-end checkpoint restore by 5.8x. PiperOrigin-RevId: 157889659 --- Commit 0c92dada6 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Use inplace Cholesky factorization and solves to speed up and reduce memory usage in matrix_solve_ls. Check succes before copying outputs in cholesky_op. PiperOrigin-RevId: 157887564 --- Commit a4caeb2ea authored by William Chargin<wchargin@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Extract the graphs dashboard to a plugin This completes the great plugin migration! The graphs plugin is somewhat different from the plugins considered so far. First, it exposes two kinds of data: graph data and run metadata. We elect to put both sources of data under the domain of the graphs plugin for now, because it's not clear that the run metadata would be useful for anything else. Second, the graph data really has no use for "tags": a run either has an associated graph or it does not. Thus, we expose an endpoint /data/plugin/graphs/runs that is different in format from the /tags routes exposed by other plugins (it returns just a list instead of a run-to-tag mapping). This change removes a bunch of tests from application_test.py. The tests cover the compresion behavior of the graph endpoint, but the graph endpoint doesn't have any special logic in the way of compression. Thus, the tests are, apparently, testing that werkzeug (or whatever is relevant here) provides good compression defaults. This isn't necessarily a bad idea, but it shouldn't be coupled to the graph tests. To get test data that includes run metadata, you can run this script: https://raw.githubusercontent.com/tensorflow/tensorflow/326942394e69074d50d5889218a24c9371eff259/tensorflow/examples/tutorials/mnist/mnist_with_summaries.py PiperOrigin-RevId: 157884714 --- Commit 05a6a13f7 authored by Gunhan Gulsoy<gunan@google.com> Committed by gunan<gunan@google.com>: Make sure all writer caches are closed before deleting directories in dnn_test. --- Commit d0e761f8d authored by Gunhan Gulsoy<gunan@google.com> Committed by gunan<gunan@google.com>: Disable another test that uses matrix_set_diag on windows. --- Commit 8939b8562 authored by Derek Murray<mrry@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: [tf.contrib.data] Re-implement IteratorGetNext as an AsyncOpKernel. This prevents the op from consuming an inter-op thread pool thread when blocked, and fixes a potential deadlock when many IteratorGetNext ops are blocked. Fixes #10369. PiperOrigin-RevId: 157878885 --- Commit 9e25c68ad authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Add loss_only_head to hold additional loss terms for multi_head setup PiperOrigin-RevId: 157875934 --- Commit 7cdcd0cca authored by Benoit Steiner<bsteiner@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Filter more op types that don't benefit from constant folding. PiperOrigin-RevId: 157875168 --- Commit 366990d92 authored by Kay Zhu<kayzhu@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: [XLA] Fix a subtle issue in copy_insertion due the interaction between copy overriding logic and RecordIndicesToColocatingBuffers: - When building instructions ShapeTree to be copy overriden, it is possible that we create a single kCopy for two identical instructions. An example can be: %tuple.19 = tuple(%constant.4, %constant.1793, %constant.1793) where it is used in a while.init operand, and constant.1793 is read-only within the loop and also used by another while loop. The copy overriding pass will then create the following (logical, not finalized) tuple: %tuple.19 = tuple(%constant.4, %copy.5, %copy.5) - In the subsequent pass RecordAmbiguousOrNonDistinctIndices, to add copies to ensure point_to set is distinct, the duplicate %copy.5 are ignored because they are not yet finalized, and these indices (1 and 2 in the example) are still marked as to-be copied. Therefore distinctiveness is lost. This fix applies to the override building stage, to explicitly avoid creating shared copies for non-distinct buffers. PiperOrigin-RevId: 157872231 --- Commit f4b8d21b8 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Change function parameters to references to avoid copying, or otherwise move from function parameters when moving reduces the amount of copying. PiperOrigin-RevId: 157867333 --- Commit 3eee61caa authored by Drew Hintz<pushespretn@gmail.com> Committed by GitHub<noreply@github.com>: fix quotes in example code from ? to " --- Commit 4905c0eae authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Remove TODO - the new tolerance is okay to keep. PiperOrigin-RevId: 157861020 --- Commit 55f6b6ff1 authored by David Soergel<soergel@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Add explicit SparseTensor support to SignatureDef. PiperOrigin-RevId: 157860466 --- Commit 79099d677 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Removes default thresholds from BinaryLogisticHead and adds predict and evaluate tests for DNNClassifier. PiperOrigin-RevId: 157856471 --- Commit 54595f0f3 authored by Jianwei Xie<xiejw@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Adds the training test for LinearClassifier with n_classes=2. PiperOrigin-RevId: 157855473 --- Commit cd6c02985 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Add 'streaming_curve_points' metric which returns curve [ROC, PR] approximation at specified number of points. PiperOrigin-RevId: 157851535 --- Commit 0f2db7391 authored by Peter Hawkins<phawkins@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: [TF:XLA] Split union-find implementation in mark_for_compilation_pass.cc into a separate library, make it more generic. PiperOrigin-RevId: 157850985 --- Commit d5421cf58 authored by Justin Lebar<jlebar@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Add additional concat test. PiperOrigin-RevId: 157844113 --- Commit f661128db authored by Geoffrey Irving<geoffreyi@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Remove unused overloads of SummarizeGraphDef and EqualGraphDef PiperOrigin-RevId: 157843404 --- Commit a56d59a84 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Set flow to a value during TensorArray creation, Re-enable tensor_array_ops_test in msan. PiperOrigin-RevId: 157841785 --- Commit edcc5cc13 authored by Justine Tunney<jart@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Add manual test runner for vz_sorting PiperOrigin-RevId: 157841098 --- Commit 3f6404f20 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Assign a max height of 800px to images in the image dashboard. The user could always expand to actual dimensions if need be. PiperOrigin-RevId: 157838046 --- Commit c6ea6972a authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Remove debugging LOG(INFO) from previous change. PiperOrigin-RevId: 157837305 --- Commit 07d39f28e authored by freedom" Koan-Sin Tan<koansin.tan@gmail.com> Committed by Benoit Steiner<benoitsteiner@users.noreply.github.com>: make gcc-5 on Ubuntu 16.04 happy (#10385) gcc-5 complains of ambiguity and refuses to go when doing something like 'bazel build -c opt tensorflow/...' --- Commit ac66be783 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Minor cleanup: Remove unused BUILD dependencies and unnecessary code. PiperOrigin-RevId: 157837211 --- Commit 4161ccc8e authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Adjust tolerance on dirichlet_multinomial test. PiperOrigin-RevId: 157834660 --- Commit 43c0f52f1 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Fix off-by-one error in BoolVector(begin, end) constructor. PiperOrigin-RevId: 157833086 --- Commit 419d437ba authored by Lakshay Garg<lakshay.garg.1996@gmail.com> Committed by Lakshay Garg<lakshay.garg.1996@gmail.com>: Fixed typo in code comment --- Commit 07710014d authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Fix device colocation for KMeans in case of multiple parameter servers. PiperOrigin-RevId: 157795360 --- Commit b659bc39f authored by Justine Tunney<jart@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Simplify TensorBoard build - Remove tensorboard_typescript_genrule - Remove tensorboard_typescript_bundle - Introduce ts_web_library Skylark rule which supports seamless TypeScript compilation. - Use Closure Compiler in semi-advanced mode to compile JavaScript. This is done in a way that preserves <script> tag placement, which causes pages to load faster and avoid FOUC, thereby making it a better solution than the existing vulcanize. PiperOrigin-RevId: 157794795 --- Commit 0503ce09c authored by Benoit Steiner<bsteiner@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Wipe out previous shape inference result when importing a grappler item Run graph optimizations last: since they can be expensive it's best to filter invalid items first. PiperOrigin-RevId: 157792834 --- Commit 9ae941c4a authored by Benoit Steiner<bsteiner@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Turn reductions along an empty set of dimensions into identity nodes. PiperOrigin-RevId: 157792209 --- Commit 69075f354 authored by Yangzihao Wang<yangzihao@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Add functional support for cudnnConvolutionBiasActivationForward(). PiperOrigin-RevId: 157788425 --- Commit 7d7a40309 authored by William Chargin<wchargin@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Extract the distributions dashboard to a plugin This continues the great plugin migration. The distributions plugin was similar to the histograms plugin, but it also purported to allow CSV download like the scalars plugin. However, the existing implementation of this was flawed, and would always yield a 500 on current prod [1] (unless there were actually no data). This indicates that no one is actually using it---probably because there isn't a relevant button on the frontend, anyway!---so I just removed it. This also changes most frontend occurrences of "compressedHistograms" to "distributions" while we're at it. [1]: Due to the reference `value.rank_in_bps` in the handler `_serve_compressed_histograms`; this field does not exist and throws an `AttributeError`. PiperOrigin-RevId: 157787156 --- Commit 23cdf96b8 authored by Brennan Saeta<saeta@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Re-enable session_test.py A number of CL's have split up session_test.py to be a bit smaller. As a result, this CL will re-enable the session_test to see if it remains flaky. PiperOrigin-RevId: 157786407 --- Commit d741d81c5 authored by Dandelion Man?<dandelion@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Expose tf.test.StubOutForTesting in the tf testing api Also redirect TensorBoard usage to use that endpoint. This is part of my ongoing effort to have TensorBoard only depend on TensorFlow via its public api, so that it can be split into a project with a fast external build. PiperOrigin-RevId: 157784552 --- Commit 40411cd5c authored by Dandelion Man?<dandelion@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Refactor projector plugin to only use tf public methods. Remove all reference to the PluginAsset system, which is deprecated. Part of an ongoing effort to have TensorBoard only consume the public TensorFlow api. PiperOrigin-RevId: 157784016 --- Commit a65a70ea5 authored by Gunhan Gulsoy<gunan@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Fix pip tests under contrib/text PiperOrigin-RevId: 157783952 --- Commit fb4bc806a authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Fix flakiness in GpuMultiSessionMemoryTest. PiperOrigin-RevId: 157781368 --- Commit f7de292df authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Update placeholder nodes' shapes in the GraphDef to reflect manually specified values for incomplete placeholder shapes. Previously, these overrides were only specified in the feed nodes, which improves estimates when using dynamic shapes but not when using static shapes. With this change, static shapes also benefit. PiperOrigin-RevId: 157780800 --- Commit eebd44123 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Add a frontend method for retrieving numeric alerts from the debugger plugin. This route responds with a list of alerts (occurrences of bad values) in ascending timestamp order. PiperOrigin-RevId: 157780270 --- Commit 5bc685d7f authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: [XLA] If an op has a single "large" operand, we want to fuse this op into some of its consumers, even if we can't fuse into all of them. PiperOrigin-RevId: 157779106 --- Commit 2ee09b873 authored by Mark Heffernan<meheff@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: [XLA] Various improvements to ShapeTree. Add support for holding non-copyable types, operator==, and a CopySubtreeFrom method for copying a subtree from one ShapeTree to another. PiperOrigin-RevId: 157777636 --- Commit 4f3ae7699 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Add beam_search kernels used by BeamSearchDecoder to tensorflow.contrib. PiperOrigin-RevId: 157775011 --- Commit 6b16c33b3 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Make audio-related logic use the audio plugin. Previously, fetching audio and related data from TensorBoard used handlers within application.py. We now remove those handlers in favor of routes offered by the audio plugin. ML Dash is updated as well. PiperOrigin-RevId: 157774953 --- Commit 8032e1f75 authored by Geoffrey Irving<geoffreyi@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Make function instantiation use std::vector<NodeDef> instead of GraphDef It's about to turn into std::vector<NodeInfoPtr>; this change gets us partway there. RELNOTES: n/a PiperOrigin-RevId: 157771141 --- Commit 2e44be35d authored by Vinu Rajashekhar<vinuraja@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Adds a protected DeleteResourceMgr(...) method in Device. PiperOrigin-RevId: 157770378 --- Commit cc346e690 authored by Benoit Steiner<bsteiner@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Strip the :x suffix when generating control inputs from input names PiperOrigin-RevId: 157770257 --- Commit d6fe47af5 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Use tensorflow::StringPiece in literal_util. Use template for RepeatedField assignment. PiperOrigin-RevId: 157765477 --- Commit 7866fa01b authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: This change significantly reduces time and resources used to load large TensorFlow graphs. For a real-world large graph (13k nodes, 20k edges), this change: * reduces all heap allocations by 19% * reduces retained (final) heap allocations by 2.2% * reduces CPU time by 11.2% In most TF graphs, the set of unique values set to Node::assigned_device_name() is quite small. This change adds an interning table to the Graph object, which contains all of the unique values used for Node::set_assigned_device_name(), as well as a look-up table. This is the main source of the reduction in retained heap memory; nearly all nodes are assigned to just one or two unique devices. This change removes the "string assigned_device_name_" field from the Node class, and replaces it with "int assigned_device_name_index_". However, because you need both the index and the name table to get the actual value, the Node::assigned_device_name() accessor needs access to the parent Graph. This requires adding a "Graph* graph_" field to the Node class. In the future, if all users of this property are converted to use Graph::assigned_device_name(Node*), then the Node::graph_ field can be deleted, and the space reclaimed. However, doing so is out of the scope of this CL, and even with this new pointer field, the Node class is smaller than it was before, so this is still a net win. The placement algorithm in simple_placer.cc is one of the main accessors of the Node::assigned_device_name property. This CL contains significant changes to simple_placer.cc, which directly take advantage of the fact that the property is an index into a name table, rather than treating it simply as a string. Many temporary allocations are also removed, which is the main source of the reduction in total heap allocations. This CL also contains a few changes that remove short-lived allocations in unrelated code, such as the changes in op.cc/h, costmodel.cc, etc. It is extremely easy in C++ to accidentally allocate memory, especially when implicit conversions and copy constructors allocate memory. All of the changes in this CL were motivated by empirical measurement, using CPU profiling and heap profiling. PiperOrigin-RevId: 157762909 --- Commit fdffafbc1 authored by Benoit Steiner<bsteiner@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Add QueueDequeueUpTo to the list of dequeue ops PiperOrigin-RevId: 157760201 --- Commit 7ad0d0698 authored by Mustafa Ispir<ispir@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Add type error to start_queue_runners if given session is not a `tf.Session`. Due to semver, we suppress the error if a MonitoredSession is provided. PiperOrigin-RevId: 157748375 --- Commit 7106f9fac authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Implemented an initial version of virtual scheduler unit test. PiperOrigin-RevId: 157746305 --- Commit b020db0c6 authored by Andrew Harp<andrewharp@google.com> Committed by Andrew Harp<andrewharp@google.com>: revert public visibility --- Commit 5b05728c2 authored by Andrew Harp<andrewharp@google.com> Committed by Andrew Harp<andrewharp@google.com>: visibility workaround 3 --- Commit 15a740ebb authored by Mustafa Ispir<ispir@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Update and Move DNNLinearCombinedRegressor to estimator/canned. PiperOrigin-RevId: 157744087 --- Commit d29bbeca3 authored by Dandelion Man?<dandelion@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Fix outdated code ref in TensorBoard README, add link to SO question. PiperOrigin-RevId: 157743374 --- Commit 9fc164225 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Fix index_table_from_file to allow vocabulary_file be a Tensor PiperOrigin-RevId: 157740677 --- Commit 0aa3e0194 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Internal change PiperOrigin-RevId: 157740660 --- Commit 02ac85399 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Introduce new class Literal to replace protobuf Literal. This renames the existing Literal message to LiteralProto and introduces a new C++ class named Literal to replace it. The LiteralProto is only used at RPC boundaries, or when protobuf-specific functionality is required. The Literal class offers a 'ToProto' function to generate a new LiteralProto message when necessary. Currently, all the static functions in class LiteralUtil, just forward to their counterparts in class Literal. This will change in a future CL. Class Literal implements all the buffers as std::vectors. The only exception is preds(), which given the std::vector<bool> representation, makes it unusable for the semantics we require (it's not possible to get the address of the underlying vector, for instance). The CL adds a BoolVector class to work around that issue. In future CLs, the std::vector representation may be changed to something more efficient, if needed. PiperOrigin-RevId: 157739125 --- Commit 207203253 authored by gunan<gunan@google.com> Committed by GitHub<noreply@github.com>: Python 3.6 support on windows. (#10356) * Python 3.6 support on windows. * Fix typo in README.md * Make environment configurable for windows gpu build. --- Commit 2b75a9a6e authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Go: Update generated wrapper functions for TensorFlow ops. PiperOrigin-RevId: 157734029 --- Commit f60b6bdcb authored by Mustafa Ispir<ispir@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Add a warning to documentation of MonitoredSession. PiperOrigin-RevId: 157728225 --- Commit eb10a4c49 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Preallocate vector storage when the ultimate vector size is known in advance PiperOrigin-RevId: 157724431 --- Commit ce32228c4 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Add release notes for Intel MKL integration. PiperOrigin-RevId: 157722003 --- Commit a23255bc0 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Adds missing group OP to benchmark PiperOrigin-RevId: 157716500 --- Commit d3e840a6c authored by Asim Shankar<ashankar@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Disable writing of compressed checkpoints. Snappy compression (and decompression) was enabled after the 1.1 release (in commit 63b2f999d3f22cfe915b89103faa1b0a1b1b7617). This means that checkpoints produced by the 1.2.0 release candidates will cause TensorFlow 1.1 (and prior) binaries to crash as they CHECK fail when trying to load snappy-compressed tables. To ease transition, disable writing of compressed checkpoints in 1.2.0 for now. Reconsider this in the next release. PiperOrigin-RevId: 157675189 --- Commit 6db400bbc authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Refactoring Python op code generation. PiperOrigin-RevId: 157675126 --- Commit d9620cab8 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Add flag to determine whether to do L1 optimizations and inline functions. Default is to do them. In tf_optimizer don't inline or do l1 optimizations. PiperOrigin-RevId: 157673614 --- Commit 25bb504cc authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Make a plugin that serves data for the audio dashboard. Subsequent changes will make TensorBoard use this audio plugin instead of the previous handlers for audio-related data. PiperOrigin-RevId: 157673132 --- Commit 24623653b authored by James Qin<jamesqin@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Fix graph text format serialization PiperOrigin-RevId: 157669530 --- Commit 3aed1735c authored by Andrew Harp<andrewharp@google.com> Committed by Andrew Harp<andrewharp@google.com>: visibility workaround 2 --- Commit fea90f89d authored by Andrew Harp<andrewharp@google.com> Committed by Andrew Harp<andrewharp@google.com>: visibility workaround --- Commit 732a6b1ae authored by Justine Tunney<jart@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Upgrade TypeScript to v2.3.4 PiperOrigin-RevId: 157667511 --- Commit 95d90ab2e authored by Luke Iwanski<luke@codeplay.com> Committed by Benoit Steiner<benoitsteiner@users.noreply.github.com>: [OpenCL] Fixes Split op (#10322) * [OpenCL] Fixes Split op Split should alway go through SYCL device * [OpenCL] Removes half from registred types --- Commit 963441400 authored by Luke Iwanski<luke@codeplay.com> Committed by Benoit Steiner<benoitsteiner@users.noreply.github.com>: [OpenCL] Extends softmax op to cover double (#10323) --- Commit a702863e8 authored by Luke Iwanski<luke@codeplay.com> Committed by Benoit Steiner<benoitsteiner@users.noreply.github.com>: [OpenCL] Extends tile ops to int16 and int32 (#10328) * [OpenCL] Extends tile ops to int16 and int32 * [OpenCL] Extends tile_ops to cover bool, uint8, int16, int64 --- Commit 75385814f authored by cxx<cxxgtxy@gmail.com> Committed by cxx<cxxgtxy@gmail.com>: Fix comments error in mnist_replica.py where only one ps is used with two works by default. --- Commit 23364e2c6 authored by Andrew Harp<andrewharp@google.com> Committed by Andrew Harp<andrewharp@google.com>: buildifier fix --- Commit e5088cb82 authored by Yao Zhang<yaozhang@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Fix discrepancy between measured and analytical cost graph. Use tf_cuda_library for utils. PiperOrigin-RevId: 157660745 --- Commit 787381ca5 authored by Brennan Saeta<saeta@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Split up session_test.py -> session_clusterspec_prop_test.py session_test.py has gotten very large. Additionally, recently it has become flaky. In order to both (1) improve overall code health, and (2) to facilitate root-causing the test flakiness, this CL begins to split apart session_test into focused subsets. I've suffixed the scoping of the session_test in order to preserve filesystem sort-order grouping. PiperOrigin-RevId: 157658981 --- Commit b09932d74 authored by Benoit Steiner<bsteiner@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Added PlaceholderWithDefault to the list of known placeholder types Use PartialTensorShape instead of TensorShapes to better handle partially known shapes PiperOrigin-RevId: 157657664 --- Commit 0462416f6 authored by Dandelion Man?<dandelion@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Add make_ndarray, tensor_proto, and MetaGraphDef to tf api. Since TensorProtos are part of the TensorFlow API, it makes sense to also include the methods that generate and parse them. Similarly, we write out MetaGraphDef protos in the summary writer, so we should provide the proto as well. This is part of an ongoing effort to have TensorBoard only consume TensorFlow methods through the public api. PiperOrigin-RevId: 157657564 --- Commit 458f94c12 authored by Wei Ho<weiho@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Open-source skip-gram ops PiperOrigin-RevId: 157655970 --- Commit faac0331c authored by Justine Tunney<jart@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Introduce tensorboard_zip_file build rule This rule can depend on web_library or tensorboard_html_binary. In both cases it will create a .zip file containing all the transitive web server paths. This can be used to deploy static assets to web servers. A small change was also made to Vulcanize to support path overriding. PiperOrigin-RevId: 157655047 --- Commit 7ed44f4c9 authored by Brennan Saeta<saeta@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Split up session_test.py -> session_partial_run_test.py session_test.py has gotten very large. Additionally, recently it has become flaky. In order to both (1) improve overall code health, and (2) to facilitate root-causing the test flakiness, this CL begins to split apart session_test into focused subsets. I've suffixed the scoping of the session_test in order to preserve filesystem sort-order grouping. PiperOrigin-RevId: 157651813 --- Commit 3c7ac46ae authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: [XLA] Teach Executable to do its own profiling (patch 4/4). This CL removes the xla::Service stub for ExecuteOnStreamWrapper so the users call the xla::Executable version directly, and simplifies the function API to simply accept "arguments" as a parameter (with a templated type) rather than requiring the user to capture it into a lambda around the relevant Executable::ExecuteOnStream method. PiperOrigin-RevId: 157651740 --- Commit 626f95ab9 authored by Peter Hawkins<phawkins@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: [TF:XLA] Don't enforce that all nodes in an encapsulated subgraph are on the same device. Use the assigned device rather than the user-requested device when converting a Graph to a FunctionDef. PiperOrigin-RevId: 157648977 --- Commit 414470329 authored by Jacques Pienaar<jpienaar@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: [XLA] Guard stream pool with mutex. A data race can occur while populating the map. PiperOrigin-RevId: 157647183 --- Commit ccdb30763 authored by Eugene Brevdo<ebrevdo@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Additional colocation options and bugfixes for TensorArray * colocate_with is now set properly when a TensorArray is passed through a while_loop * added a new argument, "colocate_with_first_write" (default: True; this is the current behavior). If False, the TensorArray is simply placed on the device from the context it's constructed in, and no colocation constraints are added. PiperOrigin-RevId: 157643133 --- Commit 03fc7022b authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Update ops-related pbtxt files. PiperOrigin-RevId: 157642677 --- Commit 41b87d6ce authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Add a new attribute narrow_range to FakeQuant* operations. It quantizes into range [1; 255] instead of [0; 255]. PiperOrigin-RevId: 157641054 --- Commit c048e2938 authored by Alexandre Passos<apassos@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Adds support to non-placeholder inputs in _graph_to_function_def. Specifically, supports input ops with more than one output tensor. PiperOrigin-RevId: 157640908 --- Commit d310de4fa authored by Brennan Saeta<saeta@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Split up session_test.py -> session_list_devices_test.py session_test.py has gotten very large. Additionally, recently it has become flaky. In order to both (1) improve overall code health, and (2) to facilitate root-causing the test flakiness, this CL begins to split apart session_test into focused subsets. I've suffixed the scoping of the session_test in order to preserve filesystem sort-order grouping. PiperOrigin-RevId: 157640788 --- Commit 8e868cf6a authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Remove unused arguments to call_cpp_shape_fn. PiperOrigin-RevId: 157640125 --- Commit 9ddbf31fe authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Use unnamed namespace to effect internal linkage, replace string constructors with array-deducing helper function PiperOrigin-RevId: 157636308 --- Commit 88ffe6276 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Increase cholesky_op_test to medium, bump shard_count 1 more. PiperOrigin-RevId: 157635774 --- Commit bef563dc8 authored by Benjamin Kramer<kramerb@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: [XLA] Don't add constraints for computations we're not currently looking at. TuplePointsToAnalysis is computed globally per module, so we add all unconstrained buffers in that module, even if it's outside of the computation we're currently running on. Then we proceed to propagate default layouts to all those buffers and then throw the constraints away because they don't affect any instruction in the current computation. PiperOrigin-RevId: 157635564 --- Commit a980aead8 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Use test_adjusted_name when making the mangled_test_name in run_and_gather_logs_lib.py, to avoid duplicate file names when the same test is run on multiple GPUs. PiperOrigin-RevId: 157630193 --- Commit 0a84cfd58 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Update ops-related pbtxt files. PiperOrigin-RevId: 157629497 --- Commit 6882effb8 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Make single-parameter constructors explicit PiperOrigin-RevId: 157628970 --- Commit 0b8070253 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Support negative axis for Split op PiperOrigin-RevId: 157628162 --- Commit 289e7bf5b authored by gunan<gunan@google.com> Committed by GitHub<noreply@github.com>: Fixes and improvements to cmake windows build. (#10354) * Disable linalg ops tests on windows. * Do not print the full source code path for logs on windows. --- Commit bc236cfc3 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Passes classification head to LinearClassifier. PiperOrigin-RevId: 157624020 --- Commit cebd7e246 authored by Luke Iwanski<luke@codeplay.com> Committed by Shanqing Cai<cais@google.com>: [OpenCL] Cleans debug ops (#10334) * [OpenCL] Cleans debug ops * Acts on feedback from #10334#discussion_r119452513 * Acts on #10334#discussion_r119459463 --- Commit fd6c3c4f1 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Fixes flaky test in dnn_linear_combined_test. PiperOrigin-RevId: 157622951 --- Commit c9cc388dc authored by Asim Shankar<ashankar@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Avoid CHECKs in BundleReader, propagate errors instead. Motivation: We'd like to evolve the checkpoint format over time (e.g., enable different types of compression). Without this change, a TensorFlow version that encounters a format that it doesn't understand would CHECK fail with an unhelpful error message. With this, it propagates a clearer error message up, giving the user some hints about what could be wrong. I don't have a unittest for this - I thought about writing a bundle and then strategically corrupting the bytes on disk before reading it back, but that seems a bit much. The intention of this change is to enable graceful reporting of forward compatibility breakages. Ideas for an appropriate unittest are appreciated. PiperOrigin-RevId: 157620358 --- Commit ee05b8b69 authored by Wei Ho<weiho@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Fix to remove TF op usage outside of the initializer fn (due to deferred execution of initializer fn, this prevent issues with graph mismatch). PiperOrigin-RevId: 157620177 --- Commit e8d17ea8c authored by Benoit Steiner<bsteiner@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Materialize shapes that are known at graph construction time into constants that can be folded PiperOrigin-RevId: 157619380 --- Commit dc0427d48 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Directly depend on the used libraries Do not rely on transitive dependencies. PiperOrigin-RevId: 157618184 --- Commit 964d1a509 authored by Yuan Yu<yuanbyu@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Fix a bug that an erroneous control edge can be introduced when loops are nested in control dependency context. PiperOrigin-RevId: 157616919 --- Commit 2de94bbb8 authored by Eli Bendersky<eliben@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: [XLA] Add an option to set the "generate HLO graph" regex without a flag. Pipes the option through xla.proto ExecutionOptions, to HloModuleConfig, which can then be accessed throughout the compiler. PiperOrigin-RevId: 157615458 --- Commit d3c0482e6 authored by My name is<raviqqe@gmail.com> Committed by gunan<gunan@google.com>: Fix a typo in export_output.py (#9975) --- Commit 0c75d9f52 authored by ddurham2<ddurham@davyandbeth.com> Committed by gunan<gunan@google.com>: Adding lost documentation to tf.abs from the old tf.complex_abs when it learned how to work on complex data. (#9954) --- Commit 84661fa73 authored by Benoit Steiner<bsteiner@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Propagate control dependencies during constant folding PiperOrigin-RevId: 157610040 --- Commit a3520340e authored by gunan<gunan@google.com> Committed by GitHub<noreply@github.com>: Improve windows bazel python test suite. (#10305) * Improve windows bazel python test suite. - Create new tags, no_windows and no_windows_gpu - Instead of a separate maintained list, use bazel tags to exclude tests. - Tag all the python tests that are known to have issues in windows. * Also blacklist neon_depthwise_conv_ops_test in windows. * Only build tests in CPU windows tests. * Only build tests in GPU windows tests. * Also disable session_test on windows. * Only run py tests on windows, and only build tests that are not disabled. --- Commit a6f284ca4 authored by Jianwei Xie<xiejw@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Adds integration tests for LinearRegressor. PiperOrigin-RevId: 157604107 --- Commit d21bf7d75 authored by Francois Chollet<fchollet@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Backport changes from Github master. PiperOrigin-RevId: 157603238 --- Commit 43bfc138c authored by Shanqing Cai<cais@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Fix OSS compilation error in tfprof_main.cc PiperOrigin-RevId: 157602449 --- Commit 904a3d075 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Fixing issue with cuda compilation related to missing include (exception is only thrown when running with sandboxing on) PiperOrigin-RevId: 157602401 --- Commit f59203c98 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Shard cholesky_op_test. PiperOrigin-RevId: 157601172 --- Commit 3fdbb5579 authored by Amit Patankar<amitpatankar@google.com> Committed by Amit Patankar<amitpatankar@google.com>: Merging rc1 back into master. --- Commit be5d98a8b authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Adds integration tests for DNNClassifier. PiperOrigin-RevId: 157592010 --- Commit a05de6cd2 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Change reporting feature importances in RandomForestEstimator to run at the end of training, instead of part of the inference graph. PiperOrigin-RevId: 157591575 --- Commit e96f1142f authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Remove unnecessary casts PiperOrigin-RevId: 157591439 --- Commit 5f8571a6b authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Fix missing namespace comments PiperOrigin-RevId: 157591364 --- Commit eeb0b4067 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Go: Update generated wrapper functions for TensorFlow ops. PiperOrigin-RevId: 157573997 --- Commit 7f9674217 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Update ops-related pbtxt files. PiperOrigin-RevId: 157573723 --- Commit 473a590c9 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Allow complex valued input for Cholesky decomposition. PiperOrigin-RevId: 157572536 --- Commit 2d1860859 authored by Blake Hechtman<blakehechtman@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Fix test name in array_elementwise_ops_test. PiperOrigin-RevId: 157552402 --- Commit a7fff05e0 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: tfprof multi-step profiling. This allows users to fill in RunMetadata across different steps. 1. It is useful for RL model which runs a subset of graph each step. 2. It also gets averages of multi-step stats. PiperOrigin-RevId: 157552388 --- Commit fe589d9e7 authored by Luke Iwanski<luke@codeplay.com> Committed by Benoit Steiner<benoitsteiner@users.noreply.github.com>: [OpenCL] Implementation improvements (#9117) * OpenCL Improvements * Registers Scatter and ScatterNd Ops for SYCL * Registers Stack op for SYCL * Fixes No sycl buffer found error for debug ops * Registers MatMul and Transpose Ops to SYCL device for double * Extends analyzer_cli_test.py test to cover SYCL * Fixes Transpose Op for double when on SYCL * Bumps Eigen version to fix double precision issue on SYCL * Extends SessionDebugTestBase to cover SYCL * Register SYCL implementations for random ops * Avoid functions that might not be defined on SYCL device (#51) * Avoid functions that might not be defined on SYCL device * Simplify by using Eigen math functions * OpenCL improvements - Bumps Eigen Version - Refactors Ops registration - Introduces workaround for Const Op related to the difference between CUDA which uses pointers and OpenCL that uses buffers/accessors - Extends memory types to cover DEVICE_SYCL as well - Introduces GetSYCLDevice() method that returns list of supported devices with GPU device having the highest priority ( doesn't include blacklisted devices ) - ::internal::Transpose -> tensorflow::internal::Transpose in order to avoid compilation reported error - re-introduces fix for bugged string replacement causing a lot of compilation warnings -c -> --include - Adds sycl_runtime to bazels ARRAY_DEPS - Replicates TF_CALL_GPU_PROXY_TYPES for SYCL * [OpenCL] Fixes an issue caused by switch to aligned allocator for sycl buffer (#53) * [Build] Use gcc/g++ as a host compiler to avoid #8394 (#54) * [OpenCL] Fixes Scatter Op * Fix testSimple and testConst in stack_op_test (#3) * Fix testSimple and testConst in stack_op_test * Create a specialisation of DoParallelConcatUpdate for SyclDevice and register it * Guard all code in TENSORFLOW_USE_SYCL * Do not use sycl device for int32 * Registration of the Sycl version is now looking like the one for the GPU * Remove added empty line * Register batch normalization kernels for OpenCL (#61) * [OpenCL] RandomGamma has no GPU friendly implementation (#57) * [OpenCL] Compatibility fixes for TensorFlow 1.1.0-rc1 * [OpenCL] Implements BatchMatmul Op for SYCL * Lowercase the device name when GPU or SYCL returned * [OpenCL] kernel_estimator_test.py assertEqual-> assertAlmostEqual due to floating point representation on the device * [Eigen] Version bump * GPU device name string manipulation is not needed anymore * [OpenCL] Adds SYCL to device backwards compatibility * [OpenCL] Extends core_rnn_test.py to run for SYCL device * [OpenCL] Minor optimizations for build script * [OpenCL] Enables skip folder list in build script * [OpenCL] Fixes ApplyAdamOp for Sycl device * [OpenCL] SYCL device improvements * [OpenCL] Fixes debug_ops's SEGFAULT for SYCL device * [Build] Adds hexagon to skipped folders list * [OpenCL] Removes EnterLameDuckMode from SYCL device and allocator * [OpenCL] Registers Unique Op for SYCL device * [OpenCL][Temporary] Disables tests for SYCL target due to features not being implemented yet Tests affected: - tensorflow/contrib/memory_stats/python/kernel_tests/memory_stats_ops_test.py - tensorflow/contrib/rnn/python/kernel_tests/core_rnn_test.py - tensorflow/python/kernel_tests/conv_ops_test.py - tensorflow/python/kernel_tests/depthwise_conv_op_test.py - tensorflow/python/kernel_tests/pooling_ops_3d_test.py - tensorflow/python/kernel_tests/pooling_ops_test.py - tensorflow/python/kernel_tests/scatter_nd_ops_test.py - tensorflow/python/training/adam_test.py - tensorflow/python/training/localhost_cluster_performance_test.py - tensorflow/python/training/training_ops_test.py * [OpenCL][Temporary] Disables failing tests for SYCL in order to establish regression baseline Tests affected: - tensorflow/python/debug/cli/analyzer_cli_test.py - tensorflow/python/debug/lib/session_debug_testlib.py - tensorflow/python/debug/lib/stepper_test.py - tensorflow/python/kernel_tests/unstack_op_test.py - tensorflow/python/ops/image_ops_test.py * [OpenCL] Take options.config.device_count() into consideration * [OpenCL] Fixes compilation warning * [OpenCL] device:SYCL:0 -> sycl:0 * [OpenCL] Removes unwanted flags in building script Removes flags given to computecpp that enable SIMD instructions Removes duplicate flags * bool -> const bool * [OpenCL] sycl in test_util.gpu_device_name() -> is_sycl_enabled() * [OpenCL][Temporary] Disables failing tests for SYCL in order to establish regression baseline Test affected: - tensorflow/contrib/stateless/python/kernel_tests/stateless_random_ops_test.py * Imports test_util from tensorflow.python.framework * [OpenCL] Fixes formatting in Python code * [OpenCL] Extends session_test.py to cover SYCL device * [OpenCL] Cleans singleton class * [OpenCL] Keeping CUDA happy * [OpenCL][Temporary] Disables failing tests for SYCL in order to establish regression baseline Test affected: - tensorflow/contrib/rnn/python/kernel_tests/core_rnn_cell_test.py - tensorflow/contrib/seq2seq/python/kernel_tests/beam_search_ops_test.py * Added support for building with SYCL on ARM. * Acts on the review feedback from: - #9117#discussion_r113608975 - #9117#discussion_r113609173 * [OpenCL] Fixes scatter_nd_op_test * Fixes auto-merge mistake * [OpenCL] struct SyclDevice -> class SyclDevice * Revert "[OpenCL] struct SyclDevice -> class SyclDevice" This reverts commit addd43348c374a5379f67bb1e5ad084715722fc2. * [OpenCL] Reverting refactoring commit. As requested in the review #9117#issuecomment-298454466 This change set will be re-introduced in smaller chunks. * Revert "[OpenCL] device:SYCL:0 -> sycl:0" This reverts commit cf16e60340b62d16c3764d71b716fe03d35f87a9. * Revert "[OpenCL] Adds SYCL to device backwards compatibility" This reverts commit b8401b5164199b7a169be1c1d8dea5001195c390. * Acts on the feedback from #9117#discussion_r115036905 * control_flow_ops_py_test.py expects device name to be lower cased * Acts on the feedback from #9117#discussion_r115037222 * Removes debug print * Removes not needed partial specialisation * [OpenCL] Registers ScatterNdFunctor for SYCL device * [OpenCL] Make it compile * [OpenCL] Follow gpu_device changes * [OpenCL] Adds cxx_builtin_include_directory for python lib Fixes bazels missing undeclared inclusions that appeared after merge with TensorFlow upstream * [OpenCL] Fixes Constant Op * [OpenCL] gXX-4.8 -> gXX * [OpenCL] Removes -D_GLIBCXX_USE_CXX11_ABI=0 as it breaks default compiler setup for Ubuntu 16.04 * Revert "[OpenCL] kernel_estimator_test.py assertEqual-> assertAlmostEqual due to floating point representation on the device" This reverts commit 06c50c0a485f40c30a436f02c3fa7794e370c49d. * [OpenCL] CPU allocator is a singleton we should not delete it --- Commit 7aac2395c authored by Blake Hechtman<blakehechtman@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Merge a copies of copies. PiperOrigin-RevId: 157549434 --- Commit 37d9d5f0e authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Add some routines for managing summaries to slim. PiperOrigin-RevId: 157541902 --- Commit d58cd2962 authored by Justine Tunney<jart@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Fix weblas license mirror URL PiperOrigin-RevId: 157537115 --- Commit 5c13ee13b authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Make images-related logic use the images plugin. Previously, fetching images and related data from TensorBoard used handlers within application.py. We now remove those handlers in favor of routes offered by the images plugin. ML Dash is updated as well. PiperOrigin-RevId: 157536471 --- Commit 60394a3d1 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Reduce size of the no-winograd tests, but still large enough that ShouldIncludeWinogradNonfusedAlgo returns true. PiperOrigin-RevId: 157535386 --- Commit 9501c4104 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Replace protobuf CopyFrom with assignment PiperOrigin-RevId: 157534272 --- Commit 96698f7fd authored by Eugene Brevdo<ebrevdo@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: [tf contrib seq2seq] Improve BeamSearchDecoder's ability to handle unknown shapes. Updated unit tests to contain inputs of unknown shape (at graph build time). Found an issue in the gather helper that stops it from properly propagating the batch size of the output shape. This caused problems with tf.while_loop. Fixed. PiperOrigin-RevId: 157533937 --- Commit 5c73d0102 authored by Neal Wu<wun@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Batch norm docs fix applied to _fused_batch_norm as well PiperOrigin-RevId: 157530527 --- Commit abd4aa49a authored by Jonathan Hseu<jhseu@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Fix docs for tf.abs() and tf.pow(). PiperOrigin-RevId: 157528475 --- Commit dd5ad6917 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Declarations of operators to support batch norm in xla PiperOrigin-RevId: 157527596 --- Commit bbeaa1307 authored by Jianwei Xie<xiejw@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Fix the expand_dim for label and weight for classifier heads. PiperOrigin-RevId: 157524909 --- Commit 346021ab4 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Cleanup: Use C++ casts, remove redundant casts, use CHECK_OK PiperOrigin-RevId: 157522142 --- Commit e405b0f6b authored by Francois Chollet<fchollet@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Refactoring of layer name autogeneration, to remove a graph serialization warning. PiperOrigin-RevId: 157520123 --- Commit 5784e1e35 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Add HasOutputProperties to check for pruned ops; Return device name instead of casting it to a short name (GPU:0/CPU:0); VLOG(2) when printing op device placement since it is a lot of output. PiperOrigin-RevId: 157519077 --- Commit 2994444bf authored by Peter Hawkins<phawkins@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Issue a more user-friendly error message if a variable's initializer is from inside a control-flow scope, such as tf.cond() or tf.while_loop(). Fixes #8604. PiperOrigin-RevId: 157516279 --- Commit da2daf068 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Remove unused using declarations PiperOrigin-RevId: 157513772 --- Commit 8b2e8b566 authored by Derek Murray<derek.murray@gmail.com> Committed by gunan<gunan@google.com>: Exclude Python test files from CMake PIP package. (#10302) * Exclude *_test.py files from the CMake-built PIP package. * Add stray _test.py file to the PIP package. * Nit. Convert tabs to spaces in tf_python.cmake --- Commit 2249a4ea8 authored by Dan Ringwalt<ringwalt@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Fix control reaching the end of ProjectiveGenerator. PiperOrigin-RevId: 157510013 --- Commit 040e2e20f authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Remove unneeded check for has properties in grappler. PiperOrigin-RevId: 157507665 --- Commit 684006955 authored by Yun Peng<pcloudy@google.com> Committed by gunan<gunan@google.com>: Windows: Remove session_test from bazel_test_lib.sh (#10274) It was disabled in 49b17146d2e4f04192d16ed67574142de167f3a1 --- Commit 890a0a407 authored by Gunhan Gulsoy<gunan@google.com> Committed by Gunhan Gulsoy<gunan@google.com>: Upgrade TF ci build and docker files to use bazel 0.5.0 --- Commit 46db634e5 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Only run the no-winograd tests once each. Only run the no-winograd tests on GPU; this also fixes timeouts in asan and msan. PiperOrigin-RevId: 157505317 --- Commit a6cd4e735 authored by Dandelion Man?<dandelion@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Remove all TB build references that circumvent TF's public API. This doesn't actually remove all the code references, lots of code references continue to work despite the BUILD references being removed. I think this is because depending on the public api transitively makes all of TensorFlow's guts available too. PiperOrigin-RevId: 157502987 --- Commit dcc3cdce8 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Remove redundant get() calls and string conversions PiperOrigin-RevId: 157497932 --- Commit af2b9d875 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Fix the trace inputs functionality of the graph explorer. After migrating to d3 v4, the graph can no longer directly index into d3.Selections to obtain elements. Instead, we must use the nodes method of d3.Selection to generate an array of selected elements. PiperOrigin-RevId: 157493509 --- Commit 5cf484584 authored by Jacques Pienaar<jpienaar@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: [XLA] Small test that performs A*B+A and A*B+B. PiperOrigin-RevId: 157492992 --- Commit b2355913b authored by Androbin<robin.richtsfeld@gmail.com> Committed by drpngx<drpngx@users.noreply.github.com>: remove some invalid entries (#10294) I noticed that some entries don't exist (anymore). This seems to be some kind of a consistency issue. More specifically: `tensorflow/contrib/ios_examples/camera/data` `tensorflow/contrib/session_bundle/testdata/saved_model_half_plus_two` `tensorflow/contrib/session_bundle/testdata/saved_model_half_plus_two/variables` This is the continuation of PR #10264 --- Commit 367ec84f8 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Add SampleEmbeddingHelper to do sampling at inference time PiperOrigin-RevId: 157487623 --- Commit a3ba225d5 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Add BatchMatMul execution cost prediction PiperOrigin-RevId: 157487507 --- Commit 34a29fc3b authored by Eric Liu<ioeric@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: [TF:XLA] preserve metadata when replacing HLO instructions. The motivation is to add metadata for HLO instructions that are created to replace existing HLO instructions during optimizations. The assumption is that the old instruction and the new instruction would perform the same function, and that they would be correlated to the same TF op. This might not always be correct since HLO optimizations can cross TF op boundaries. But still this seems to be better than nothing. Note that this still doesn't fully resolve missing OpMetadata after HLO optimizations; new instructions might be added without using ReplaceInstruction. PiperOrigin-RevId: 157484394 --- Commit 092a7b6e6 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Disable keras lstm test in tsan. PiperOrigin-RevId: 157484268 --- Commit 7280dafca authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Use "empty" member function to test for emptiness PiperOrigin-RevId: 157483181 --- Commit 6c3b15915 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Expands integration tests in dnn_test. PiperOrigin-RevId: 157476608 --- Commit 727193b1f authored by Androbin<robin.richtsfeld@gmail.com> Committed by drpngx<drpngx@users.noreply.github.com>: add missing import for `signal` package (#10264) * add missing import for `signal` package * add missing dependency for `signal` package * Update tf_python.cmake --- Commit 21461213d authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Remove unused BUILD dependencies PiperOrigin-RevId: 157473460 --- Commit 4788ca2be authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Fix handling of Infinity/NaN in line chart domain Test Plan: - Use the script listed below to generate data that has enough infinities for these values to not be treated as outliers. - Load the data into TensorBoard (`--logdir /tmp/infbug`) and look at the scalars plot; also look at the console. - Before this change, the chart is completely blank, and there is a console warning: "QuantitativeScales cannot take NaN or Infinity as a domain value. Ignoring." - After this change, there is no console output, and the chart appears as intended: a reasonable domain is shown, and the infinities just shoot off the chart. Generating script: ```py import tensorflow as tf LOGDIR = '/tmp/infbug' STEPS = 134 def main(): x = tf.Variable(3.1415) y = x.assign_add(x) tf.summary.scalar('y', y) summ = tf.summary.merge_all() sess = tf.Session() writer = tf.summary.FileWriter(LOGDIR) writer.add_graph(sess.graph) sess.run(tf.global_variables_initializer()) for step in xrange(STEPS): writer.add_summary(sess.run(summ), step) writer.close() if __name__ == '__main__': main() ``` PiperOrigin-RevId: 157472340 --- Commit 49476a62c authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Remove unused namespace aliases PiperOrigin-RevId: 157468609 --- Commit d83074847 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Use "nullptr" for null pointer values PiperOrigin-RevId: 157468186 --- Commit b73fea6e2 authored by Tim Harley<tharley@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Refactor `tf.Operation.traceback` implementation in to methods of tf.Graph. Adds an `_extract_frame_info` method to allow derived classes to extend the information available in each op traceback, if desired. The default result of `tf.Operation.traceback` is unchanged. Also fixes a poorly scoped `pylint disable=line-too-long`, so adds the necessary enable/disable blocks to silence pylint for the offending docstrings. PiperOrigin-RevId: 157466174 --- Commit f7ca8db7d authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: [XLA] Improve shape inference error messages for DynamicSlice/DynamicUpdateSlice. PiperOrigin-RevId: 157461335 --- Commit 8c2a079ec authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Adding a slot / accumulator warmstart initializer that overrides the provided partitioner at call time with one passed at construction time. This is intended to be used for slot Variables (such as accumulators) associated with Optimizers, since these Variables are created in a fashion that relies on replicating the exact shape of the associated primary variables (see slot_creator). PiperOrigin-RevId: 157453498 --- Commit 73d10599f authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Default CUDNN_HOME to CUDA_TOOLKIT_TARGET_DIR. The cuDNN distro is most naturally installed in the same directory as the CUDA SDK, so try to find it there if the user doesn't specify any other directory. PiperOrigin-RevId: 157436253 --- Commit eb7cf9331 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Go: Update generated wrapper functions for TensorFlow ops. PiperOrigin-RevId: 157429266 --- Commit 346dcc0a4 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Update ops-related pbtxt files. PiperOrigin-RevId: 157429078 --- Commit 3d5ede131 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Update documentation for sparse_matmul op to reflect gradient calculation. PiperOrigin-RevId: 157428135 --- Commit 822d64f0c authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Fix embedding_lookup() bug where normalization did not work with ids of rank != 1. PiperOrigin-RevId: 157422220 --- Commit 8cad6b824 authored by Jianwei Xie<xiejw@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Improve the error message for live set memory check. PiperOrigin-RevId: 157415647 --- Commit 34dcd5b49 authored by Eugene Brevdo<ebrevdo@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: [tf contrib seq2seq] Bugfixes to BeamSearchDecoder Implementation by Cinjon Resnick. He can't push this since he's traveling. I just copied the fix and added some small syntax tweaks to make the unit tests pass. More comprehensive unit tests will come in the near future. Fixes at least part of #9904. BeamSearchDecoder: 1. Fix the bug where we don't pass the next cell state through. 2. Gather the cell state (and attention if that's a part of the model as an AttentionWrapper on the cell) according to the next_beam_ids. PiperOrigin-RevId: 157415564 --- Commit f7ae1461c authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Fix oversampling in the GPU version of multinomial due to an error in generating gumbel noise. -log(-log(U)) gives infinity if U draws a hard 0. Adds a tiny offset to U (2e-30) to avoid log(U) = -inf. The CPU sampling algorithm depends on the order of the logits which is undesirable and can also oversample the first logit if it is smaller than the smallest random float larger than 0 (~1e-7). Switching to double precision internally mitigates these problems, although it doesn't fix them. Slowdown is ~35% in the worst case. Also adds various tests that we would like the sampling to pass. CPU Benchmark before: 32 10000 1 0.060 0.069 0.87 32 10000 4 0.229 0.074 3.10 32 10000 32 2.180 0.059 37.09 32 100000 1 0.430 0.480 0.90 32 100000 4 2.322 0.449 5.17 32 100000 32 31.508 0.471 66.96 128 10000 1 0.168 0.235 0.71 128 10000 4 0.965 0.246 3.93 128 10000 32 7.989 0.225 35.51 128 100000 1 1.681 1.539 1.09 128 100000 4 9.012 1.57 35.73 128 100000 32 126.222 1.626 77.60 CPU Benchmark after: 32 10000 1 0.054 0.112 0.48 32 10000 4 0.206 0.093 2.21 32 10000 32 1.826 0.091 20.12 32 100000 1 0.292 0.636 0.46 32 100000 4 2.086 0.606 3.44 32 100000 32 28.496 0.633 45.03 128 10000 1 0.125 0.266 0.47 128 10000 4 0.759 0.258 2.94 128 10000 32 7.362 0.254 29.03 128 100000 1 1.550 2.18 10.71 128 100000 4 8.712 2.22 23.92 128 100000 32 122.585 2.213 55.39 PiperOrigin-RevId: 157414849 --- Commit 62cf561f1 authored by Jianwei Xie<xiejw@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Add numpy_input_fn integration for LinearRegressor and fix the expand_dim for label and weight. PiperOrigin-RevId: 157405237 --- Commit 40c7e0dd7 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Go: Update generated wrapper functions for TensorFlow ops. PiperOrigin-RevId: 157402364 --- Commit 2726c00ce authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Update ops-related pbtxt files. PiperOrigin-RevId: 157402063 --- Commit e9d2fba8f authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Fix comment describing ignore_longer_outputs_than_inputs. PiperOrigin-RevId: 157400110 --- Commit 5f097217f authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: An initial step of eliminating all implicit broadcast at the HLO level. Guard the shape inference for binary ops behind a flag. PiperOrigin-RevId: 157373647 --- Commit e78e5ec8a authored by Yangzihao Wang<yangzihao@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Set winograd nofused flag to be true by default. Disable winograd nonfused conv for certain input params to avoid a known bug in cuDNNv5 and cuDNNv6. PiperOrigin-RevId: 157352847 --- Commit 3f9b69a50 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: In the CUDA path of depthwise_conv2d, add a fast variant for forward convolution when the input images are smaller than 16x16. PiperOrigin-RevId: 157347823 --- Commit 848123e61 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Fix incorrect condition to instantiate depthwise_ops introduced in commit 15d9f00fa. The change should have excluded depthwise_conv2d for doubles on windows debug builds, but it excluded it for all windows and all debug builds. PiperOrigin-RevId: 157345929 --- Commit 060d67b34 authored by Taehoon Lee<taehoonlee@snu.ac.kr> Committed by Taehoon Lee<taehoonlee@snu.ac.kr>: Fix typos --- Commit 409419bcc authored by Mark Daoust<markdaoust@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: add closing code quotes PiperOrigin-RevId: 157339360 --- Commit d20d0a623 authored by Jonathan Hseu<jhseu@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Fix the contrib estimator_test by updating the global step in all the appropriate spots. PiperOrigin-RevId: 157328239 --- Commit d1144d3a9 authored by Juang, Yi-Lin<b02901026@ntu.edu.tw> Committed by Juang, Yi-Lin<b02901026@ntu.edu.tw>: Fix typos --- Commit fa8bb43b1 authored by lanhin<lanhin1@gmail.com> Committed by lanhin<lanhin1@gmail.com>: Fixed a comment typo in GraphView:InitializeNode(), executor.cc. --- Commit 9f13ae93f authored by Asim Shankar<ashankar@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Java: Update Maven release to 1.2.0-rc1 PiperOrigin-RevId: 157294719 --- Commit c8256769c authored by Gunhan Gulsoy<gunan@google.com> Committed by Gunhan Gulsoy<gunan@google.com>: Address comments and sanity check failures. --- Commit 344225a60 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Update ops-related pbtxt files. PiperOrigin-RevId: 157292254 --- Commit eb2f6d041 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: VLOG(2) instead of VLOG(1) for detailed op printouts. PiperOrigin-RevId: 157291238 --- Commit b4466279a authored by Shanqing Cai<cais@google.com> Committed by TensorFlower Gardener<gardener@tensorflow.org>: tfdbg: add runtime shape and dtype info to DebugNumericSummary PiperOrigin-RevId: 157291215 --- Commit 4fb2425f8 authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: Add GraphOptimizer to Grappler item builder to do L1 optimizations and inlining. Op Counts Comparison (BNMT) Counts: Profile vs Grappler Op: Add, 968 vs 965 Op: AddN, 2228 vs 2228 Op: ApplyGradientDescent, 84 vs 84 Op: BatchMatMul, 998 vs 998 Op: Identity, 142 vs 105 Op: MatMul, 63 vs 63 Op: Mul, 10318 vs 10306 Op: OneHot, 1 vs 1 Op: Reshape, 8421 vs 8422 Op: Select, 488 vs 488 Op: Shape, 8132 vs 8131 Op: Sigmoid, 942 vs 942 Op: Softmax, 19 vs 19 Op: StridedSlice, 58 vs 74 Op: Sub, 1398 vs 1394 Op: Tanh, 333 vs 333 Op: Tile, 21 vs 21 Op: Transpose, 39 vs 39 PiperOrigin-RevId: 157288420 --- Commit 8918fa9ef authored by A. Unique TensorFlower<gardener@tensorflow.org> Committed by TensorFlower Gardener<gardener@tensorflow.org>: BEGIN_PUBLIC Automated g4 rollback of changelist 157272843 PiperOrigin-RevId: 158534336
* Add bazel version check to .configureGravatar A. Unique TensorFlower2017-06-07
| | | | PiperOrigin-RevId: 158294569
* When GPU is configured, do not require --config=cuda.Gravatar Gunhan Gulsoy2017-06-06
| | | | | | Also fix indentation in configure. PiperOrigin-RevId: 158232959
* Merge changes from github.Gravatar Benoit Steiner2017-05-11
| | | | PiperOrigin-RevId: 155709893
* Merge changes from github.Gravatar Dan Ringwalt2017-05-05
| | | | Change: 155209832
* Use $opt defined in the loop rather than raw string.Gravatar Patrick Nguyen2017-05-04
| | | | | Fixes #9651. Change: 155089799
* Fix issue related to empty bazel.rc file.Gravatar A. Unique TensorFlower2017-05-02
| | | | Change: 154840138
* Removing python_config.sh, moved functionality asking users for input to ↵Gravatar A. Unique TensorFlower2017-04-27
| | | | | | root configure moved lib checks to python_configure.bzl Change: 154412830
* Merge changes from github.Gravatar Shanqing Cai2017-04-22
| | | | Change: 153925676
* Automated rollback of change 153736477Gravatar A. Unique TensorFlower2017-04-21
| | | | Change: 153825726
* Automated rollback of change 153709951Gravatar Shanqing Cai2017-04-20
| | | | Change: 153736477
* Adding python_configure.bzl to (partially) replace python_config.shGravatar A. Unique TensorFlower2017-04-20
| | | | Change: 153709951
* Fix ./configure scriptGravatar A. Unique TensorFlower2017-04-05
| | | | | | 1. Add %workspace% in .bazelrc file when using import statement 2. Write action_env into bazelrc file for required environment variables for OpenCL support Change: 152290700
* Merge changes from github.Gravatar A. Unique TensorFlower2017-04-04
| | | | Change: 152200430
* Add cuda_clang build configuration that allows to use clang as a CUDA compiler.Gravatar A. Unique TensorFlower2017-03-30
| | | | Change: 151705528
* Merge changes from github.Gravatar Martin Wicke2017-03-23
| | | | Change: 151046259
* configure: Add option to restrict bazel targets fetched.Gravatar Asim Shankar2017-03-13
| | | | | | Use that option to restrict the Windows C and Java library release builds to the 4 targets that are actually built. Change: 149990258
* Merge changes from github.Gravatar Dandelion Mané2017-03-10
| | | | Change: 149800363
* Merge changes from github.Gravatar Vijay Vasudevan2017-02-17
| | | | Change: 147897309
* Merge changes from github.Gravatar Benoit Steiner2017-02-08
| | | | Change: 146918929
* Add xla target for tensorflow tests that request them.Gravatar Eugene Brevdo2017-01-27
| | | | Change: 145856327
* Delete downloaded BUILD files for makefile dependencies immediately after ↵Gravatar Andrew Harp2017-01-27
| | | | | | extracting and also at the beginning of ./configure. This will prevent bazel from erroneously parsing contrib/makefile/downloads during configuration if a makefile build has previously been performed. Change: 145838744
* Automated rollback of change 145703555Gravatar Eugene Brevdo2017-01-27
| | | | Change: 145809900
* Make LSTMCell use Defuns to speed up static graph builds, add compiled flag.Gravatar Eugene Brevdo2017-01-26
| | | | Change: 145703555
* Merge changes from github.Gravatar Shanqing Cai2017-01-23
| | | | Change: 145363673
* In ./configure, only ask about jemalloc when running on Linux.Gravatar Jonathan Hseu2017-01-17
| | | | | | | | Also add functions to test whether we're running on Linux or macOS, and convert every check over to them. Fixes #6869 Change: 144761341
* Merge changes from github.Gravatar Patrick Nguyen2017-01-17
| | | | Change: 144729490
* Create --config opt for compiling fully optimized binaries.Gravatar Gunhan Gulsoy2017-01-16
| | | | Change: 144609556
* Enable customization of architecture dependent compiler optimizations via ↵Gravatar Martin Wicke2017-01-13
| | | | | | configure, defaulting to -march=native. Change: 144470919
* Switch open-source to use jemalloc for CPU Tensor memory allocation, gRPC, ↵Gravatar Jonathan Hseu2017-01-11
| | | | | | | | and other places where we call malloc/free. - Only enabled on Linux for now. - Added as a ./configure option defaulting to enabled. Change: 144266237
* Initial open-source release of XLA: Accelerated Linear Algebra.Gravatar Peter Hawkins2017-01-09
| | | | | | | XLA is a compiler-based linear algebra execution engine that targets CPUs, GPUs and custom accelerators. XLA is still experimental; we are releasing it early to get the community involved. Change: 143990941
* Merge changes from github.Gravatar Shanqing Cai2017-01-08
| | | | Change: 143922699
* Merge changes from github.Gravatar Rohan Jain2017-01-05
| | | | Change: 143710639
* Merge changes from github.Gravatar A. Unique TensorFlower2017-01-02
| | | | Change: 143412147
* Remove default support for Google Cloud Platform that was introduced recently.Gravatar A. Unique TensorFlower2016-12-28
| | | | | | | Some users do not want or need this support and it significantly increases their dependencies and binary sizes. After this change, users must explicitly depend on :gcs_file_system to enable this support. Change: 143129406
* Make TensorFlow build libcurl from scratchGravatar Justine Tunney2016-12-27
| | | | | | | Google Cloud Platform support is now available for everyone by default. Fixes #5349 Change: 143077241
* Merge changes from github.Gravatar Jonathan Hseu2016-12-22
| | | | Change: 142805270