| Commit message (Collapse) | Author | Age |
|\ |
|
| |
| |
| |
| | |
As proposed by static analysis tool:
https://github.com/koalaman/shellcheck/wiki/SC2155
|
| |
| |
| |
| | |
As proposed by static analysis tool:
https://github.com/koalaman/shellcheck/wiki/SC2027
|
| | |
|
| |
| |
| |
| | |
As proposed by static analysis tool:
https://github.com/koalaman/shellcheck/wiki/SC2207
|
| |
| |
| |
| | |
As proposed by static analysis tool:
https://github.com/koalaman/shellcheck/wiki/SC2166
|
| |
| |
| |
| | |
PiperOrigin-RevId: 158294569
|
|\| |
|
| |
| |
| |
| |
| |
| | |
Also fix indentation in configure.
PiperOrigin-RevId: 158232959
|
| |
| |
| |
| |
| |
| | |
* MKL_INSTALL_PATH should not be clear when given
* fix overwrite by default
|
|\ \
| | |
| | | |
configure: Fix default path when enabling MPI.
|
| | |
| | |
| | |
| | | |
Reverting #8880 (see #10236)
unnecessary since bazelbuild/bazel#2759 was merged
|
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | | |
* Update docker to cudnn6.
* Update Dockerfile.gpu
* Add --expunge to bazel clean to make cuda_configure run again and update TF_CUDNN_VERSION.
* Remove expunge and set CUDA and CUDNN version default in configure.
* Update configure
* Only set --action_env once
* Update prints for default version.
|
|/ /
| |
| |
| | |
Correct showing what the default path is when mpi is installed.
|
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| | |
* OpenCL Improvements
* Registers Scatter and ScatterNd Ops for SYCL
* Registers Stack op for SYCL
* Fixes No sycl buffer found error for debug ops
* Registers MatMul and Transpose Ops to SYCL device for double
* Extends analyzer_cli_test.py test to cover SYCL
* Fixes Transpose Op for double when on SYCL
* Bumps Eigen version to fix double precision issue on SYCL
* Extends SessionDebugTestBase to cover SYCL
* Register SYCL implementations for random ops
* Avoid functions that might not be defined on SYCL device (#51)
* Avoid functions that might not be defined on SYCL device
* Simplify by using Eigen math functions
* OpenCL improvements
- Bumps Eigen Version
- Refactors Ops registration
- Introduces workaround for Const Op related to the difference between
CUDA which uses pointers and OpenCL that uses buffers/accessors
- Extends memory types to cover DEVICE_SYCL as well
- Introduces GetSYCLDevice() method that returns list of supported devices
with GPU device having the highest priority ( doesn't include blacklisted devices )
- ::internal::Transpose -> tensorflow::internal::Transpose in order to
avoid compilation reported error
- re-introduces fix for bugged string replacement causing a lot of compilation
warnings -c -> --include
- Adds sycl_runtime to bazels ARRAY_DEPS
- Replicates TF_CALL_GPU_PROXY_TYPES for SYCL
* [OpenCL] Fixes an issue caused by switch to aligned allocator for sycl buffer (#53)
* [Build] Use gcc/g++ as a host compiler to avoid https://github.com/tensorflow/tensorflow/issues/8394 (#54)
* [OpenCL] Fixes Scatter Op
* Fix testSimple and testConst in stack_op_test (#3)
* Fix testSimple and testConst in stack_op_test
* Create a specialisation of DoParallelConcatUpdate for SyclDevice and
register it
* Guard all code in TENSORFLOW_USE_SYCL
* Do not use sycl device for int32
* Registration of the Sycl version is now looking like the one for the GPU
* Remove added empty line
* Register batch normalization kernels for OpenCL (#61)
* [OpenCL] RandomGamma has no GPU friendly implementation (#57)
* [OpenCL] Compatibility fixes for TensorFlow 1.1.0-rc1
* [OpenCL] Implements BatchMatmul Op for SYCL
* Lowercase the device name when GPU or SYCL returned
* [OpenCL] kernel_estimator_test.py assertEqual-> assertAlmostEqual due to floating point representation on the device
* [Eigen] Version bump
* GPU device name string manipulation is not needed anymore
* [OpenCL] Adds SYCL to device backwards compatibility
* [OpenCL] Extends core_rnn_test.py to run for SYCL device
* [OpenCL] Minor optimizations for build script
* [OpenCL] Enables skip folder list in build script
* [OpenCL] Fixes ApplyAdamOp for Sycl device
* [OpenCL] SYCL device improvements
* [OpenCL] Fixes debug_ops's SEGFAULT for SYCL device
* [Build] Adds hexagon to skipped folders list
* [OpenCL] Removes EnterLameDuckMode from SYCL device and allocator
* [OpenCL] Registers Unique Op for SYCL device
* [OpenCL][Temporary] Disables tests for SYCL target due to features not being implemented yet
Tests affected:
- tensorflow/contrib/memory_stats/python/kernel_tests/memory_stats_ops_test.py
- tensorflow/contrib/rnn/python/kernel_tests/core_rnn_test.py
- tensorflow/python/kernel_tests/conv_ops_test.py
- tensorflow/python/kernel_tests/depthwise_conv_op_test.py
- tensorflow/python/kernel_tests/pooling_ops_3d_test.py
- tensorflow/python/kernel_tests/pooling_ops_test.py
- tensorflow/python/kernel_tests/scatter_nd_ops_test.py
- tensorflow/python/training/adam_test.py
- tensorflow/python/training/localhost_cluster_performance_test.py
- tensorflow/python/training/training_ops_test.py
* [OpenCL][Temporary] Disables failing tests for SYCL in order to establish regression baseline
Tests affected:
- tensorflow/python/debug/cli/analyzer_cli_test.py
- tensorflow/python/debug/lib/session_debug_testlib.py
- tensorflow/python/debug/lib/stepper_test.py
- tensorflow/python/kernel_tests/unstack_op_test.py
- tensorflow/python/ops/image_ops_test.py
* [OpenCL] Take options.config.device_count() into consideration
* [OpenCL] Fixes compilation warning
* [OpenCL] device:SYCL:0 -> sycl:0
* [OpenCL] Removes unwanted flags in building script
Removes flags given to computecpp that enable SIMD instructions
Removes duplicate flags
* bool -> const bool
* [OpenCL] sycl in test_util.gpu_device_name() -> is_sycl_enabled()
* [OpenCL][Temporary] Disables failing tests for SYCL in order to establish regression baseline
Test affected:
- tensorflow/contrib/stateless/python/kernel_tests/stateless_random_ops_test.py
* Imports test_util from tensorflow.python.framework
* [OpenCL] Fixes formatting in Python code
* [OpenCL] Extends session_test.py to cover SYCL device
* [OpenCL] Cleans singleton class
* [OpenCL] Keeping CUDA happy
* [OpenCL][Temporary] Disables failing tests for SYCL in order to establish regression baseline
Test affected:
- tensorflow/contrib/rnn/python/kernel_tests/core_rnn_cell_test.py
- tensorflow/contrib/seq2seq/python/kernel_tests/beam_search_ops_test.py
* Added support for building with SYCL on ARM.
* Acts on the review feedback from:
- https://github.com/tensorflow/tensorflow/pull/9117#discussion_r113608975
- https://github.com/tensorflow/tensorflow/pull/9117#discussion_r113609173
* [OpenCL] Fixes scatter_nd_op_test
* Fixes auto-merge mistake
* [OpenCL] struct SyclDevice -> class SyclDevice
* Revert "[OpenCL] struct SyclDevice -> class SyclDevice"
This reverts commit addd43348c374a5379f67bb1e5ad084715722fc2.
* [OpenCL] Reverting refactoring commit.
As requested in the review https://github.com/tensorflow/tensorflow/pull/9117#issuecomment-298454466
This change set will be re-introduced in smaller chunks.
* Revert "[OpenCL] device:SYCL:0 -> sycl:0"
This reverts commit cf16e60340b62d16c3764d71b716fe03d35f87a9.
* Revert "[OpenCL] Adds SYCL to device backwards compatibility"
This reverts commit b8401b5164199b7a169be1c1d8dea5001195c390.
* Acts on the feedback from https://github.com/tensorflow/tensorflow/pull/9117#discussion_r115036905
* control_flow_ops_py_test.py expects device name to be lower cased
* Acts on the feedback from https://github.com/tensorflow/tensorflow/pull/9117#discussion_r115037222
* Removes debug print
* Removes not needed partial specialisation
* [OpenCL] Registers ScatterNdFunctor for SYCL device
* [OpenCL] Make it compile
* [OpenCL] Follow gpu_device changes
* [OpenCL] Adds cxx_builtin_include_directory for python lib
Fixes bazels missing undeclared inclusions that appeared after
merge with TensorFlow upstream
* [OpenCL] Fixes Constant Op
* [OpenCL] gXX-4.8 -> gXX
* [OpenCL] Removes -D_GLIBCXX_USE_CXX11_ABI=0 as it breaks default compiler setup for Ubuntu 16.04
* Revert "[OpenCL] kernel_estimator_test.py assertEqual-> assertAlmostEqual due to floating point representation on the device"
This reverts commit 06c50c0a485f40c30a436f02c3fa7794e370c49d.
* [OpenCL] CPU allocator is a singleton we should not delete it
|
|\ \
| | |
| | | |
MPI based communication path for tensor exchange operations
|
| | | |
|
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | | |
fix: typos
This PR is part of a campaign to fix a lot of typos on github!
You can see the progress on https://github.com/fixTypos/fix_typos/
https://github.com/client9/misspell
|
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | | |
* Bazel CI / TensorFlow: fix more errors
In this commit:
1. Keep just the first character of the python
version checker script's output. This is a simple
way to strip \n and \r and we only need the major
version anyway. The string is later used in
writing the .bazelrc file where these characters
would cause problems.
2. Export environment variables in
run_configure_for_cpu_build, use their default
values
3. Allow a user-defined value for BAZEL_SH
4. Fix python_configure.bzl:
4.a. Fix the symlink creation on Windows: create a
single junction pointing to the src_dir. This is a
trick to get around the problem that on Windows we
can't create symlinks. By creating this symlink
and declaring the same outputs as on Linux/Darwin,
we make it look like that the genrule successfully
produced these files. This only works in a
single-machine execution environment.
4.b. Get rid of the "select" in
@local_config_python//:numpy_include and
python_include. The results of `find` and `dir`
only make sense on Linux/Darwin and Windows
respectively, so there's no point in generating
the non-Windows genrule on Windows and the
Windows-specific one on non-Windows, because all
the paths would be wrong. So instead of that,
generate just the genrule corresponding to the
platform.
4.c. Write the genrule.cmd's commands to scripts
and execute those instead. This avoids the
difficulties with escaping backslashes and quotes.
Backslashes are necessary on Windows because paths
with forward slashes aren't generally accepted,
e.g. "cmd.exe /c foo/bar" doesn't work because
Windows believes foo is an executable and /bar is
its switch:
c:\tempdir>type bar.cmd
@echo this is bar.cmd, %%1=(%1)
c:\tempdir>cmd.exe /c bar/baz
this is bar.cmd, %1=(/baz)
5. simplify the "is_windows" function in
./configure
Fixes https://github.com/bazelbuild/bazel/issues/2892
* Fix bad cc_library.includes from prev. commit
* bazel_test_lib.sh: add envvars to CPU build config
* Fix junction creation in python_configure.bzl
Bazel deletes the output files from a previous
build before running an action, and if the
junction pointing to the source directory is
present then Bazel would delete the source files,
believing they are in the output directory.
Thanks to @meteorcloudy on
https://github.com/tensorflow/tensorflow/pull/9762
|
| | |
| | |
| | |
| | |
| | |
| | |
| | | |
When running ./configure for the first time, an import ~/.bazelrc statement
will be added to the generated .bazelrc if the user has one.
Fixes #9963
See also bazelbuild/bazel#3022
|
|/ /
| |
| |
| | |
distributed TensorFlow
|
| |
| |
| |
| | |
PiperOrigin-RevId: 155709893
|
| |
| |
| |
| |
| | |
not prompted to ask MKL download if the var is set already
remove unexpected exit when input is given by user
|
| | |
|
| |
| |
| |
| | |
Change: 155209832
|
|\| |
|
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| | |
* relu grad and maxpooling grad fixes for perf
* Graph layout pass and conversion pass changes
This commit makes following changes:
- Enables support for ReluGrad and BiasAddGrad
- Adds support for detecting depthwise/batchwise pooling
- Adds more unit tests for Graph rewrite pass
- Improvements to handling control-flow edges
- Bug fixes
* Defaulting to Eigen when LRN depth_radius!=2
* Fixed mkl_conv_grad_filter.cc for conv_ops_tests.py
* Style fix to mkl_matmul and remove unnecessary 'MKL' label on matmul kernel
* Style fixes based on clang-format to mkl_conv_* and mkl_matmul
* Bug fixes
* Adding OP_REQUIRES_OK check in Concat
* Making some style changes
* Enabled the configuration of MKL settings
* relu grad and maxpooling grad fixes for perf
* Graph layout pass and conversion pass changes
This commit makes following changes:
- Enables support for ReluGrad and BiasAddGrad
- Adds support for detecting depthwise/batchwise pooling
- Adds more unit tests for Graph rewrite pass
- Improvements to handling control-flow edges
- Bug fixes
* Defaulting to Eigen when LRN depth_radius!=2
* Fixed mkl_conv_grad_filter.cc for conv_ops_tests.py
* Style fix to mkl_matmul and remove unnecessary 'MKL' label on matmul kernel
* Style fixes based on clang-format to mkl_conv_* and mkl_matmul
* Bug fixes
* Adding OP_REQUIRES_OK check in Concat
* Making some style changes
* Enabled the configuration of MKL settings
* Fixing graph unit tests with Mkl op name change to _Mkl; Fixed missing _ in MklToTf op
* Fixed missing libdl.so.2 in BUILD file
* Fixes for unit test build failures.
* Changes in mkl_conv_grad_filter_ops.cc for Google code style
* Fixes to remove dead code
* removed the dead code and added a TODO for mkl implementation to handle this case in the future
* Fixed buildifier sanity check error
* Adding support for google's CI automation
* Updated link to new MKL version
* Fix for missing locate command in CI
* Adding updatedb to populate the database after installing mlocate
* Fixed buildifier issue
* setting tf_need_mkl=0 in libtf files
* Added third_party/mkl/* to .gitignore
* Added third_party/eigen3/mkl_include to .gitignore
* In configured, set MKL-enabling options only for Linux.
|
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| | |
This change:
1. updates common_env.sh to export PYTHON_LIB_PATH
along with PYTHON_BIN_PATH so the configure
scripts doesn't have to guess
2. writes these paths to bazelrc with quotes
around, to guard agains spaces in the path (e.g.
"C:/Program Files/Anaconda3/python")
Fixes https://github.com/bazelbuild/bazel/issues/2892
|
| |
| |
| |
| |
| | |
Fixes #9651.
Change: 155089799
|
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| | |
* Enable grappler to propagate shapes through queues.
Change: 154789133
* Add whitelist support in uid of RunConfig.
Change: 154794859
* Fix a bunch of bad links and missing docs in contrib.
Change: 154820641
* Don't try to refine the shapes for a node if its inference context wasn't
successfully built by the AddNode() method.
Change: 154838211
* Fix issue related to empty bazel.rc file.
Change: 154840138
* Remove overly precise CHECK when rendering debug output for a function.
An `_Arg` node can have more than three attrs, because the runtime may
(and does) add system-defined attrs (viz. "_output_shapes") that do
not change the meaning of the op.
Change: 154850526
* Port makefile build breakage
Change: 154855106
* [TF:XLA] Try to incorporate Tensorflow node structure for large HLO GraphDefs.
This change assumes that a TF subgraph/op does not cross the boundary of a HLO
computation and always put top-level TF subgraphs/ops under HLO computations.
Change: 154855884
* Added a unit test to check what happens when 2 shapes with known rank but
unknown dimensions are merged
Change: 154856675
* [XLA] Refactor constant folding operations into a dedicated module
Refactor constant folding operations into a dedicated module, and added a new
ReplaceInstruction() API to collapse { computation->ReplaceInstruction();
changed=true}.
Change: 154857025
* Java: Docs: Update instructions for Windows.
Inspired by
http://stackoverflow.com/questions/43741775/tensorflow-in-java-running-failed
Change: 154859066
* Add more documentation for features and labels.
Change: 154859649
* Added link to high-performance models
Change: 154860213
* Navigation and index for new performance section documents.
Change: 154862215
* Fix shape mismatch between loss and weights.
Change: 154862650
* Add examples to TensorShape documentation and ran autoformatter.
Change: 154862667
* Move linking of cudnn_plugin, cublas_plugin and cufft_plugin from
stream_executor to the ops that need them.
Change: 154863520
* Properly track the persistent memory usage of lookup tables.
Change: 154866686
* Reset the inputs to ShapeRefiner::RunShapeFn so that it behaves the same every time it's called.
To properly handle queues that have populated by several enqueue ops, merge the shapes of the inputs to all the enqueue ops before calling InferenceContext::set_output_handle_shape(). This ensures that we detect incorrect queue setups (where the 2 enqueue ops might generate tensors with incompatible shapes), and that we take all the known shape information instead of that of just one of the enqueue ops.
Change: 154866747
* Making sure an error message will be produced by session_manager when a non-tensor object is passed in.
Otherwise the 'name' property is missing.
Change: 154868022
* Don't needlessly synchronize the CUDA stream in CropAndResize.
Make the op Async so we don't block an executor thread while waiting for the result of the box bounds check to be copied back to the host.
Change: 154868460
* Add contribution guidelines and standards section to CONTRIBUTING.md
Several parts are largely based on the post by @yaroslavvb at: #7443#issuecomment-279182613
Fixes #7443
Change: 154876045
* Final draft
Change: 154876563
* Final draft
Change: 154876646
* Fix losses documentation.
Fix documentation of get_total_loss() to be correct.
And add a helpful comment about a common pitfall.
Change: 154876822
* [XLA] Second change for HLO interpreter.
Extends HloEvaluator to allow evaluation of HLO Computation or single HLO instruction
with non-constant operands, by traversing the instruction in post order and keeps track of
each instruction along the way as evaluated literals.
Change: 154877580
* [tf distributions] Move the remaining whitelisted distributions to core.
Change: 154878206
* Add shape to error message.
Change: 154880260
* Revert "Fix build issue when `/usr/bin/python` path is not available (#9547)"
This reverts commit 95f37ebf0bd46c328266f65bbd16d319c0efab3d.
|
| |
| |
| |
| | |
Change: 154840138
|
| | |
|
|/
|
|
|
|
| |
* make configure smarter in detecting cuda
* Remove variable
|
|
|
|
|
|
| |
root configure moved lib checks to python_configure.bzl
Change: 154412830
|
|
|
|
| |
Change: 153925676
|
|
|
|
| |
Change: 153825726
|
|
|
|
| |
Change: 153736477
|
|
|
|
| |
Change: 153709951
|
|
|
|
|
|
| |
1. Add %workspace% in .bazelrc file when using import statement
2. Write action_env into bazelrc file for required environment variables for OpenCL support
Change: 152290700
|
|
|
|
| |
Change: 152200430
|
|
|
|
| |
Change: 151705528
|
|
|
|
| |
Change: 151046259
|
|
|
|
|
|
| |
Use that option to restrict the Windows C and Java library release
builds to the 4 targets that are actually built.
Change: 149990258
|
|
|
|
| |
Change: 149800363
|
|
|
|
| |
Change: 147897309
|
|
|
|
| |
Change: 146918929
|
|
|
|
| |
Change: 145856327
|
|
|
|
|
|
| |
extracting and also at the beginning of ./configure. This will prevent bazel from erroneously parsing contrib/makefile/downloads during configuration if a makefile build has previously been performed.
Change: 145838744
|
|
|
|
| |
Change: 145809900
|
|
|
|
| |
Change: 145703555
|