Fixes & Version bump#33
Merged
benoitsteiner merged 34 commits intobenoitsteiner:masterfrom Jan 30, 2017
Merged
Conversation
Collaborator
lukeiwanski
commented
Jan 22, 2017
- Fixes documentation issue mentioned in Correct path and update version for ComputeCpp #32 (comment)
- Bumps Eigen to the latest version
added 24 commits
December 30, 2016 10:44
This commit reduces number of failing tests when TensorFlow compiles for OpenCL support. - Registration of Type Traits required for stride slise op - Registration of ConcatOffset, _ListToArray, _ArrayToList Pad, Reverse ( CPU ), ReverseV2 ( CPU ), Size, ExpandDims, Squeeze, StridedSlice, StridedSliceGrad, StridedSliceAssign, TileGrad, InvertPermutation, Transpose - Registration of Sycl kernels only for essential data types - Floor_div_real has been disabled for SYCL device - Device in control_flow_ops_py_test.py needed to be lower cased
Removed FloorDiv, Reverse and ReverseV2 registration for SYCL device as the cpu implementation of the kernels won't be able to access the input data.
- Expm1, Greater, GreaterEqual, Maximum, Reciprocal, Select, Sigmoid, Sign Ops for SYCL - Disables ComplexPow for SYCL device
- Registers Cast, LessEqual, Minimum, MatMul Ops - Disables BatchMatMul test for SYCL device
- MatMul only for floats - Int32 Add Op registration
- Equal, Max, Mean, Min, Prod Op registered for SYCL - Disabled Mean_XXX tests for SYCL device due to lack of support from Eigen
- ComputeCpp no longer compiles external projects nor llvm
- ComputeCpp checks file to determin if the device compilation pass is needed
- ReduceFunctor specialisation for MeanReducer on the SYCLDevice - Re-enable Mean tests
This reverts commit e1f8235.
- Fixes documentation issue mentioned in benoitsteiner#32 (comment)
Conflicts: tensorflow/workspace.bzl
This was referenced Jan 22, 2017
1a961a2 to
58e4d7d
Compare
benoitsteiner
pushed a commit
that referenced
this pull request
Feb 10, 2017
benoitsteiner
pushed a commit
that referenced
this pull request
Feb 14, 2017
benoitsteiner
added a commit
that referenced
this pull request
Feb 25, 2017
* OpenCL improvements Added Tile, Transpose and Range Ops double support for SYCL device. Moved gpu_device_name() to test_util.py so now it can be used in force_gpu to pull either GPU or SYCL depending on what is available in the system. * Improvements to the SYCL device support - Registration of Type Traits required for stride slice op - Registration of ConcatOffset, _ListToArray, _ArrayToList Pad, Reverse ( CPU ), ReverseV2 ( CPU ), Size, ExpandDims, Squeeze, StridedSlice, StridedSliceGrad, StridedSliceAssign, TileGrad, InvertPermutation, Transpose - Registration of Sycl kernels only for essential data types - Floor_div_real has been disabled for SYCL device - Device in control_flow_ops_py_test.py needed to be lower cased * SYCL support improvements (#31) * Improvements to the SYCL device support This commit reduces number of failing tests when TensorFlow compiles for OpenCL support. - Registration of Type Traits required for stride slice op - Registration of ConcatOffset, _ListToArray, _ArrayToList Pad, Reverse ( CPU ), ReverseV2 ( CPU ), Size, ExpandDims, Squeeze, StridedSlice, StridedSliceGrad, StridedSliceAssign, TileGrad, InvertPermutation, Transpose - Registration of Sycl kernels only for essential data types - Floor_div_real has been disabled for SYCL device - Device in control_flow_ops_py_test.py needed to be lower cased * Fixes & Version bump (#33) * Fix Unbuntu typo. (#38) unbuntu -> ubuntu * Add problem descriptions and solutions (#35) * Add ComputeCpp lib folder to LD_LIBRARY_PATH * Add ImportError problem + solution If you get the error message "ImportError: libComputeCpp.so: cannot open shared object file: No such file or directory", make sure you have added the path to ComputeCpp's lib folder to your `LD_LIBRARY_PATH`. * Add another ImportError problem + solution If you get the error message "ImportError: cannot import name 'pywrap_tensorflow'" you may be standing in the TensorFlow directory. * Improvements to the SYCL device support * Registers FloorDiv, FloorMod and SoftMax Ops for SYCL device * Workaround for 0 bytes allocation for SYCL device (#42) * Sycl improvements (#44) - Eigen version bump - Extends Cast and Cwise ops benchmark to cover Sycl device - Extends device_lib_test.py to cover Sycl device - Registers int32, string and ResourceHandler to run on host for Enter and RefEnter Sycl Ops - Enables RecudeMax op for Sycl since Eigen implementation is ready - Registers Less op for Sycl device * Improved the formatting of the SYCL code * Fixed compilation error. * Made sure that using test sessions with force_gpu=True forces the placement on a gpu device even if none is detected.
benoitsteiner
pushed a commit
that referenced
this pull request
Feb 25, 2017
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.