| Commit message (Collapse) | Author | Age |
|
|
|
| |
Change: 142169284
|
|
|
|
| |
Change: 142074581
|
|
|
|
| |
Change: 142041070
|
| |
|
| |
|
|
|
|
|
|
|
|
| |
target.
Add missing zlib dependency to LLVM build file.
Various other small cleanups.
Change: 141557751
|
|
|
|
|
|
|
|
| |
Additionally:
- change single quotes to double quotes to make path rewriting easier
- guard windows lib reference with PLATFORM_WINDOWS
- fixed failing kmeans test
Change: 141515942
|
|
|
|
|
|
| |
configure time.
Change: 141474932
|
|
|
|
|
|
| |
built with --config=cuda.
Change: 141406402
|
|
|
|
|
|
|
| |
allows non --config=cuda builds to link the CUDA plugins if desired.
Add build macro if_cuda_is_configured() that tests whether CUDA was enabled at configure time, rather than whether the current BUILD is using a CUDA compiler.
Change: 141375030
|
|
|
|
|
|
| |
builds.
Change: 141352525
|
|
|
|
|
|
|
|
|
|
| |
1. Created open-source target libdevice_root that wraps all libdevice files.
2. platform/posix/cuda_libdevice_path depends on the libdevice_root target.
3. Added cuda_libdevice_path_test that verifies libdevice files exist in the
computed libdevice directory.
Change: 141237087
|
|
|
|
|
|
| |
--config=cuda.
Change: 140807677
|
|
|
|
| |
Change: 140616374
|
|
|
|
| |
Change: 140533246
|
|
|
|
| |
Change: 140396287
|
|
|
|
| |
Change: 140088388
|
|
|
|
| |
Change: 139938302
|
|
|
|
| |
Change: 139832288
|
|
|
|
| |
Change: 139516555
|
|
|
|
|
|
|
|
|
| |
This makes JPEG go 2x faster on x86_64 (k8), arm7, and arm8. On all
other CPU targets, e.g. x86, JPEG performance should be the same as it
was before.
Fixes #4807
Change: 139295768
|
|
|
|
| |
Change: 138980879
|
|
|
|
| |
Change: 138937852
|
|
|
|
|
|
| |
is then added to the list of dependencies in the main "all_opensource_files" filegroup
Change: 138698669
|
|
|
|
|
| |
google3, and enable it in github. This is because we haven't imported the backed in google3 just yet.
Change: 138689620
|
|
|
|
| |
Change: 138675832
|
|
|
|
| |
Change: 138143557
|
|
|
|
| |
Change: 137532946
|
|
|
|
|
|
|
|
|
| |
This should address some of the ongoing python 3.5-related build failures in:
nightly-matrix-cpu
nightly-matrix-linux-gpu
nightly-matrix-mac-gpu
nightly-python35-linux-cpu
Change: 137268906
|
| |
|
|
|
|
|
|
|
|
|
|
| |
This change allows Bazel to fetch and build SWIG rather than getting it
from the system. This change also improves the i/o performance of the
SWIG build, makes it hermetically sealed, and ensures tf_py_wrap_cc()
can function correctly across Bazel repositories.
CC: #4983
Change: 136783531
|
|
|
|
| |
Change: 136750267
|
|
|
|
| |
Change: 136615121
|
|
|
|
| |
Change: 136423498
|
|
|
|
| |
Change: 135698415
|
| |
|
|
|
|
| |
Change: 134721831
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
This change does the following:
- Always use {,new_}http_archive rather than git_repository
- Make liberal use of strip_prefix
- Clarify licenses() in BUILD files
- On POSIX include headers like a normal C/C++ program
This change accomplishes the following:
- Reduce download size >100MB: The biggest culprit is grpc which has
tens of thousands of commits in its GitHub repository.
- Reduce disk size >200MB: On disk, grpc takes up 250MB when cloned even
though the tarball of the git repo is 3.2MB. By never using git
externals, we save on network.
- Consume less cpu: Cloning git repositories is much slower than
downloading and extracting a tarball.
Change: 133895791
|
|
|
|
| |
Change: 133874452
|
|
|
|
| |
Change: 133842773
|
|
|
|
| |
Change: 133779175
|
|
|
|
| |
Change: 133650335
|
| |
|
|
|
|
| |
Change: 133096559
|
|
|
|
| |
Change: 132733397
|
|
|
|
| |
Change: 131437429
|
|
|
|
| |
Change: 131310818
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
batches of dense matrices. This calls Eigen::JacobiSVD<Matrix, Eigen::HouseholderQRPreconditioner> which is known to be rather slow. This change is primarily intended to get the TensorFlow interfaces and functionality in place. We intend to swap out the "backend" with a higher performance algorithm implementation in the future.
This CL also contains a small refactoring of the LinearAlgebraOp base class:
1. I moved the initial processing of inputs and outputs into separate helper functions so Compute() is not so long.
2. The derived classes are now allowed to return fewer output matrix shapes (n) than the number of op outputs (m) in which case empty (shape[0]) tensors are returned for the last m-n outputs.
Fixed a few Python linter errors that were blocking presubmit.
Change: 128990912
|
|
|
|
| |
Change: 128958134
|
|
|
|
| |
Change: 128401884
|