Sparse wide deep learning by whatbeg · Pull Request #4 · qiuxin2012/BigDL

whatbeg · 2017-09-14T02:17:07Z

What changes were proposed in this pull request?

(Please fill in changes proposed in this patch)

How was this patch tested?

(Please explain how this patch was tested. E.g. unit tests, integration tests, manual tests)
(If it is possible, please attach a screenshot; otherwise, remove this)

Related links or issues (optional)

fixed https://github.com/intel-analytics/BigDL/issues/XXX

whatbeg · 2017-09-19T06:18:17Z

@qiuxin2012 合到你的widedeep分支没有冲突啊

* bug fix: DLModel prediction (#4) Make sure DLModel.train=False when predicting in pipeline API * 1. broadcast transformer in DLModel.transform ; 2. remove useless ut

This feature enables mkl-dnn support, which can speed up deep learning model. We wrapper the native c api in the java, which are in BigDL-core projects. And in BigDL, we integrated the convolution, batchnorm, maxpooling, avgpooling, relu, lrn, softmax, caddtable and concattable. Currently, it supports create the model which only contains dnn layer or container. Because the data layout is optimized in mkl-dnn. The mkl-dnn model will use `DnnTensor` which contains the native buffer as a default tensor. So there're some notations, 1. User should copy the data from jvm heap at the first layer and copy back to jvm heap at the last layer. 2. User should compile the model, which contains the phase (training/inference) and input tensor size. It will infer and allocate the other information. * fix: linear performance issue and serialization of java object in MklDnnTensor * memory leak refactor * memory leak and bn performance issues 1. Memory Leak The internal buffer with MklDnnTensor should not be re-assigned without releasing. So we should check it first. At first iteration or after the changing of input size, we create a new MklDnnTensor as a buffer. 2. Bn perf The JIT BatchNormalization only supports avx2 or avx512, which has much batter performance than ref version. The input and gradOutput format should be the same to get the best performance. * test: add some test cases for BatchNorm. The computation of float value is not the same as C/C++/Native with JVM. And batch norm will make it much greater such as 10^-8 -> 10^-4 -> 10^-1 * fix: rebase with upstream master: 1. Concat and ConcatTable should inherit from DynamicContainer. 2. updateParameters has been depricated. 3. zeroGradParameters should be final. But from now on, the Linear should use it. 4. Some other syntax or semantic errors. * perf: single node and single model performance * perf: single model * feat: add fusion for mkl-dnn * test: add test utils to compare dnn output * test: add some tests compared with caffe * add unit tests for dnn tensor * add unit test for reorder memory * test: fix the test regression errors * checkin reorder manager * add backward for sequential * fix some bugs * update core ref * add unit tests * refactor: move the static class DataType, AlgKind and so on to standalone class (#4) * refactor: delete MklDnn.MemoryFormat * refactor: move the static class DataType, AlgKind and so on to standalone class * fix: core refactor errors * refactor: spec errors (#5) * Mkl dnn dev (#6) * checkin reorder manager * add container and refine reorder manager * fix merge issue * add join table forward * refine inteface (#7) * add LRN and ReLU * add pooling * refactor: conv + linear + bn * add JoinTable backward * refactor: conv + linear + bn * add cAddTable concattable * fix: reorder failed on some of convs * refactor: softmax * refactor: fusion support * refactor: resnet_50 * refactor: move tests to this branch * refactor: delete unusefull files and enable the special old tests. refactor: delete unsed methods in MklDnnOps fix: scalastyle check * fix: rebase with upstream * fix: ignore the prototxt tests * fix: do not change the core commit ref * fix: move set num of threads for mkldnn to ResNet50Perf * fix: serialization disabled for mkldnn module

add utils for widedeep

5a3b38b

whatbeg force-pushed the sparseWideDeep branch 2 times, most recently from e52c07a to 7229bde Compare September 14, 2017 02:24

add sparse wide deep learning

960c49d

whatbeg force-pushed the sparseWideDeep branch from a5bb95a to 960c49d Compare September 14, 2017 02:25

whatbeg added 3 commits September 14, 2017 15:01

Spec plan

6e33e6b

dd

0ba407d

widedeep notebook 09/15

5b9edd4

whatbeg force-pushed the sparseWideDeep branch from 8348b5c to 5b9edd4 Compare September 15, 2017 08:07

fix SparseTensorBLAS

b9bf7a1

qiuxin2012 pushed a commit that referenced this pull request Feb 26, 2018

bug fix: DLModel prediction (intel#2194)

43f023f

* bug fix: DLModel prediction (#4) Make sure DLModel.train=False when predicting in pipeline API * 1. broadcast transformer in DLModel.transform ; 2. remove useless ut

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sparse wide deep learning#4

Sparse wide deep learning#4
whatbeg wants to merge 6 commits into
qiuxin2012:widedeepfrom
whatbeg:sparseWideDeep

whatbeg commented Sep 14, 2017

Uh oh!

whatbeg commented Sep 19, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

whatbeg commented Sep 14, 2017

What changes were proposed in this pull request?

How was this patch tested?

Related links or issues (optional)

Uh oh!

whatbeg commented Sep 19, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant