| Commit message (Collapse) | Author | Age |
|
|
|
| |
Change: 140616374
|
|
|
|
| |
Change: 140533246
|
|
|
|
| |
Change: 140396287
|
|
|
|
| |
Change: 140088388
|
|
|
|
| |
Change: 139938302
|
|
|
|
| |
Change: 139832288
|
|
|
|
| |
Change: 139516555
|
|
|
|
|
|
|
|
|
| |
This makes JPEG go 2x faster on x86_64 (k8), arm7, and arm8. On all
other CPU targets, e.g. x86, JPEG performance should be the same as it
was before.
Fixes #4807
Change: 139295768
|
|
|
|
| |
Change: 138980879
|
|
|
|
| |
Change: 138937852
|
|
|
|
|
|
| |
is then added to the list of dependencies in the main "all_opensource_files" filegroup
Change: 138698669
|
|
|
|
|
| |
google3, and enable it in github. This is because we haven't imported the backed in google3 just yet.
Change: 138689620
|
|
|
|
| |
Change: 138675832
|
|
|
|
| |
Change: 138143557
|
|
|
|
| |
Change: 137532946
|
|
|
|
|
|
|
|
|
| |
This should address some of the ongoing python 3.5-related build failures in:
nightly-matrix-cpu
nightly-matrix-linux-gpu
nightly-matrix-mac-gpu
nightly-python35-linux-cpu
Change: 137268906
|
| |
|
|
|
|
|
|
|
|
|
|
| |
This change allows Bazel to fetch and build SWIG rather than getting it
from the system. This change also improves the i/o performance of the
SWIG build, makes it hermetically sealed, and ensures tf_py_wrap_cc()
can function correctly across Bazel repositories.
CC: #4983
Change: 136783531
|
|
|
|
| |
Change: 136750267
|
|
|
|
| |
Change: 136615121
|
|
|
|
| |
Change: 136423498
|
|
|
|
| |
Change: 135698415
|
| |
|
|
|
|
| |
Change: 134721831
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
This change does the following:
- Always use {,new_}http_archive rather than git_repository
- Make liberal use of strip_prefix
- Clarify licenses() in BUILD files
- On POSIX include headers like a normal C/C++ program
This change accomplishes the following:
- Reduce download size >100MB: The biggest culprit is grpc which has
tens of thousands of commits in its GitHub repository.
- Reduce disk size >200MB: On disk, grpc takes up 250MB when cloned even
though the tarball of the git repo is 3.2MB. By never using git
externals, we save on network.
- Consume less cpu: Cloning git repositories is much slower than
downloading and extracting a tarball.
Change: 133895791
|
|
|
|
| |
Change: 133874452
|
|
|
|
| |
Change: 133842773
|
|
|
|
| |
Change: 133779175
|
|
|
|
| |
Change: 133650335
|
| |
|
|
|
|
| |
Change: 133096559
|
|
|
|
| |
Change: 132733397
|
|
|
|
| |
Change: 131437429
|
|
|
|
| |
Change: 131310818
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
batches of dense matrices. This calls Eigen::JacobiSVD<Matrix, Eigen::HouseholderQRPreconditioner> which is known to be rather slow. This change is primarily intended to get the TensorFlow interfaces and functionality in place. We intend to swap out the "backend" with a higher performance algorithm implementation in the future.
This CL also contains a small refactoring of the LinearAlgebraOp base class:
1. I moved the initial processing of inputs and outputs into separate helper functions so Compute() is not so long.
2. The derived classes are now allowed to return fewer output matrix shapes (n) than the number of op outputs (m) in which case empty (shape[0]) tensors are returned for the last m-n outputs.
Fixed a few Python linter errors that were blocking presubmit.
Change: 128990912
|
|
|
|
| |
Change: 128958134
|
|
|
|
| |
Change: 128401884
|
|
|
|
|
|
| |
improvements for fp16
Added SpecialFunctions to the list of eigen headers TensorFlow depends on
Change: 127264575
|
|
|
|
| |
Change: 127253427
|
|
|
|
|
| |
improvements for fp16
Change: 127233960
|
|
|
|
| |
Change: 126335170
|
|
|
|
|
|
| |
handle per-thread buffer allocation for the tileable executor without resorting to thread_local that is not fully supported on Android.
Change: 126009029
|
|
|
|
|
|
| |
will enable the implementation of the cumsum operation in TensorFlow
Change: 125697517
|
|
|
|
|
| |
performance of the toy mnist training by 1 order of magnitude
Change: 124374286
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
NEW
BM_fullReduction/10 4591 4595 153149 20.8M items/s
BM_fullReduction/64 5073 5075 100000 770.0M items/s
BM_fullReduction/512 9067 9070 75263 26.9G items/s
BM_fullReduction/4k 243984 244125 2868 64.0G items/s
BM_fullReduction/5k 359125 359273 1951 64.8G items/s
OLD
BM_fullReduction/10 9085 9087 74395 10.5M items/s
BM_fullReduction/64 9478 9478 72014 412.1M items/s
BM_fullReduction/512 14643 14646 46902 16.7G items/s
BM_fullReduction/4k 260338 260384 2678 60.0G items/s
BM_fullReduction/5k 385076 385178 1818 60.5G items/s
Change: 124290852
|
|
|
|
|
| |
gradients, some variants etc.).
Change: 124197406
|
|
|
|
| |
Change: 124183870
|
|
|
|
|
| |
gradients, some variants etc.).
Change: 123967787
|
|
|
|
|
| |
gradients, some variants etc.).
Change: 123967117
|
|
|
|
| |
Change: 123901292
|