| Commit message (Collapse) | Author | Age |
|
|
|
| |
PiperOrigin-RevId: 183848459
|
|
|
|
| |
PiperOrigin-RevId: 175228264
|
|
|
|
| |
Change: 123900938
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
Also move a few helper functions into the BCast class.
Before:
Benchmark Time(ns) CPU(ns) Iterations
------------------------------------------------
BM_BCastSetup/0 114 114 6062466 different_shapes
BM_BCastSetup/1 20 20 34863786 same_shapes
After:
Benchmark Time(ns) CPU(ns) Iterations
------------------------------------------------
BM_BCastSetup/0 126 126 5562646 different_shapes
BM_BCastSetup/1 32 32 21627499 same_shapes
The performance hit likely stems from 1-2 branch mispredictions (~7ns per misprediction).
Change: 121437477
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
where both shapes are the same by using much more straightforward
code to achieve the same ultimate initialization of the various
instance variables with simpler code.
Added benchmark for this to bcast_test.cc. Speeds up the same_shape
case by 65% (67 ns to 23 ns for a two-dimensional shape) without
any real effect on the different shape case.
Run on machine with (40 X 2801 MHz CPUs); 2016/01/28-11:12:26
CPU: Intel Ivybridge with HyperThreading (20 cores) dL1:32KB dL2:256KB dL3:25MB
Benchmark Base (ns) New (ns) Improvement
------------------------------------------------------------------
BM_BCastSetup/0 122 122 +0.0%
BM_BCastSetup/1 67 23 +65.7%
Change: 113374076
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
Changes:
* error message that refers to removed `DefaultSession` method.
* -Wnull-conversion warnings
* the "_start_time" attr for recvs when the flag "--brain_enable_scheduling_for_recvs" is set.
* typo in tutorial data download progress message.
* a typo ("however their installing"=>"however installing").
* typo, rename "TensorFlow Mechanics" to "How To" to be consistent with the website.
* a typo ("subtact"=>"subtract").
* protobuf examples in comments in tensorflow::Example.proto.
* formula formatting in MNIST beginner tutorial
* negative fraction-of-queue-full stats
* protobuf inclusion path so that Android demo will build under Blaze.
* small typo (moderatly > moderately)
* Session.run() to check that tensor arguments come from the session's graph.
* another six import
* seq2seq typo in bazel command
Base CL: 108349164
|
|
TensorFlow is an open source software library for numerical computation
using data flow graphs.
Base CL: 107276108
|