Skip to content

Fixes & Version bump#33

Merged
benoitsteiner merged 34 commits intobenoitsteiner:masterfrom
lukeiwanski:master
Jan 30, 2017
Merged

Fixes & Version bump#33
benoitsteiner merged 34 commits intobenoitsteiner:masterfrom
lukeiwanski:master

Conversation

@lukeiwanski
Copy link
Collaborator

luke iwanski added 24 commits December 30, 2016 10:44
This commit reduces number of failing tests when TensorFlow compiles
for OpenCL support.

 - Registration of Type Traits required for stride slise op
 - Registration of ConcatOffset, _ListToArray, _ArrayToList
   Pad, Reverse ( CPU ), ReverseV2 ( CPU ), Size, ExpandDims,
   Squeeze, StridedSlice, StridedSliceGrad, StridedSliceAssign,
   TileGrad, InvertPermutation, Transpose
 - Registration of Sycl kernels only for essential data types
 - Floor_div_real has been disabled for SYCL device
 - Device in control_flow_ops_py_test.py needed to be lower cased
Removed FloorDiv, Reverse and ReverseV2 registration for SYCL device
as the cpu implementation of the kernels won't be able to access
the input data.
 - Expm1, Greater, GreaterEqual, Maximum, Reciprocal, Select, Sigmoid, Sign Ops for SYCL
 - Disables ComplexPow for SYCL device
 - Registers Cast, LessEqual, Minimum, MatMul Ops
 - Disables BatchMatMul test for SYCL device
 - MatMul only for floats
 - Int32 Add Op registration
 - Equal, Max, Mean, Min, Prod Op registered for SYCL
 - Disabled Mean_XXX tests for SYCL device due to lack
   of support from Eigen
     - ComputeCpp no longer compiles external projects nor llvm
     - ComputeCpp checks file to determin if the device compilation pass is needed
 - ReduceFunctor specialisation for MeanReducer on the SYCLDevice
 - Re-enable Mean tests
 - Fixes documentation issue mentioned in benoitsteiner#32 (comment)
Conflicts:
	tensorflow/workspace.bzl
@lukeiwanski lukeiwanski force-pushed the master branch 3 times, most recently from 1a961a2 to 58e4d7d Compare January 27, 2017 12:21
@benoitsteiner benoitsteiner merged commit 26137d6 into benoitsteiner:master Jan 30, 2017
benoitsteiner pushed a commit that referenced this pull request Feb 10, 2017
benoitsteiner pushed a commit that referenced this pull request Feb 14, 2017
benoitsteiner added a commit that referenced this pull request Feb 25, 2017
* OpenCL improvements

Added Tile, Transpose and Range Ops double support for SYCL device.
Moved gpu_device_name() to test_util.py so now it can be used in force_gpu to pull either GPU or SYCL depending on what is available in the system.

* Improvements to the SYCL device support

 - Registration of Type Traits required for stride slice op
 - Registration of ConcatOffset, _ListToArray, _ArrayToList
   Pad, Reverse ( CPU ), ReverseV2 ( CPU ), Size, ExpandDims,
   Squeeze, StridedSlice, StridedSliceGrad, StridedSliceAssign,
   TileGrad, InvertPermutation, Transpose
 - Registration of Sycl kernels only for essential data types
 - Floor_div_real has been disabled for SYCL device
 - Device in control_flow_ops_py_test.py needed to be lower cased

* SYCL support improvements (#31)

* Improvements to the SYCL device support

This commit reduces number of failing tests when TensorFlow compiles
for OpenCL support.

 - Registration of Type Traits required for stride slice op
 - Registration of ConcatOffset, _ListToArray, _ArrayToList
   Pad, Reverse ( CPU ), ReverseV2 ( CPU ), Size, ExpandDims,
   Squeeze, StridedSlice, StridedSliceGrad, StridedSliceAssign,
   TileGrad, InvertPermutation, Transpose
 - Registration of Sycl kernels only for essential data types
 - Floor_div_real has been disabled for SYCL device
 - Device in control_flow_ops_py_test.py needed to be lower cased

* Fixes & Version bump (#33)

* Fix Unbuntu typo. (#38)

unbuntu -> ubuntu

* Add problem descriptions and solutions (#35)

* Add ComputeCpp lib folder to LD_LIBRARY_PATH

* Add ImportError problem + solution

If you get the error message "ImportError: libComputeCpp.so: cannot open shared
object file: No such file or directory", make sure you have added the
path to ComputeCpp's lib folder to your `LD_LIBRARY_PATH`.

* Add another ImportError problem + solution

If you get the error message "ImportError: cannot import name
'pywrap_tensorflow'" you may be standing in the TensorFlow directory.

* Improvements to the SYCL device support

* Registers FloorDiv, FloorMod and SoftMax Ops for SYCL device

* Workaround for 0 bytes allocation for SYCL device (#42)

* Sycl improvements (#44)

- Eigen version bump
 - Extends Cast and Cwise ops benchmark to cover Sycl device
 - Extends device_lib_test.py to cover Sycl device
 - Registers int32, string and ResourceHandler to run on host for
   Enter and RefEnter Sycl Ops
 - Enables RecudeMax op for Sycl since Eigen implementation is ready
 - Registers Less op for Sycl device

* Improved the formatting of the SYCL code

* Fixed compilation error.

* Made sure that using test sessions with force_gpu=True forces the
placement on a gpu device even if none is detected.
benoitsteiner pushed a commit that referenced this pull request Feb 25, 2017
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants