GSoC 2016 Log
Tracking project progress and contributions for scikit-learn.
-
- 22 Aug
#6586:
Refactor the code (thanks @MechCoder) and add tests.
- 21 Aug
#7218:
Make `enet_coordinate_descent_gram` support fused types.
-
- 19 Aug
#6586:
Refactor the code.
- 17 Aug - 18 Aug
#6913:
Making documents more clear.
#6586:
Fixing type issues.
- 14 Aug - 16 Aug
#7187:
PR merged: Rename CV params n_{folds,iter} to n_splits
- Close #7169 which want to address inconsistency between different CV splitters.
-
- 12 Aug - 13 Aug
#7184:
PR merged: Add whats_new about fixing example.
#7187:
Open a new PR to rename CV params n_{folds,iter} to n_splits.
- 9 Aug - 11 Aug
#7170:
PR merged: Make silhouette_score support sparse X.
- Close #6317 and #6318 due since silhouette_score now supports sparse X.
#6344:
PR merged: DOC Add LabelKFold in _BaseKFold's docstring.
- 8 Aug
- New blog post out! Workaround to use fused types class attributes .
-
- 6 Aug
#7153:
Make neighbor tree module support Cythob fused types.
- Find a bug in Cython and reproduce it here.
- 5 Aug
#7145:
Fix a trivial typo since I'm a little frustrated about CD issues.
- 3-4 Aug
- Trying declare `data` as void* and cast it in evert functions.
- 1-2 Aug
- Still suffer from testing for CD issues.
- 31 July
- Writing blog post about how to reproduce CI enviroment
- Visiting my grandparents.
-
- 30 July:
- Suffer from Cython compilation error.
- 28-29 July:
- Investigating on the true reason which caused precision error of CD according to this comment, thanks MechCoder.
- Reading some sections in Oreilly Cython book which cover C++ with Cython.
#6913:
Fix PEP8.
- 26-27 July:
- Try C++ with Cython.
#6913:
Fix PEP8.
- 24-25 July:
#6913:
Open a Windows 32-bit VM and a Ununtu 14.04 VM and set them up to develop scikit in order to check the mysterious bug, still can't figure out after carefully inspect input.
#6913:
Decide to continue the PR by checking if computer is not 32 bit to avoid error.
- Reproduce the fused types error and ask it on Cython mailing list to confirm.
-
- 23 July:
#6913:
Checking input data for saxpy function
line 131, it occurrs error on 32-bit computer, and I still can't figure out.
- New blog post: Interactive Cython with IPython, no compilation anymore!
- Chnage my blog theme to make it easier to read.
- 22 July:
#6913:
Find bug occurrs when calling saxpy function's
line 131, but still don't know the core reason.
#7059:
Try to use void* to declare data and then typecast it in each funtion,
temporarily fails.
- Discover that Cython can be tested in iPython without compilation, which is super
useful to try out new things. Will write a blog post to it tomorrow.
- 21 July:
#6913:
Using
this mechanism provided by appveyor to debug.
#7059:
Discuss about how to make neighbor trees module support fused types.
- Create a branch to illustrate my proposed solution since
fused types can't be used to declare class attributes.
- 20 July:
- New blog post -
How to set up 32bit scikit learn on Mac without additional installation
#6913:
- Trying to Find methods to test scikit-learn when PYTHON_ARCH=32
#7053:
- Reduce dupliated code using fused types fptr.
- 19 July:
- New blog post -
Using Function Pointer to Maximize Code Reusability in Cython
- Report the analysis about making SequentialDataset to support Cython fused types, see
Analysis report
here.
- 17-18 July:
#6913:
Addressing reviewers' comments.
-
- 14 July:
#6913:
Writing blog post about CD with fused types. (Although CI is not happy)
- 12-13 July:
- Investigating whether SequentialDataset can be modified to support Cython fused types.
- 11 July:
#6913:
- Make CD algorithms in scikit-learn support fused types when data is in sprase format.
-
- 8 July:
#6913:
Add tests.
- 7 July:
#6913:
- Resolve code duplication problem in PR #6913 of scikit-learn.
- Try dsdot, sdsdot to fix precision problems when making CD algorithm support fused types.
- 6 July:
Create a repo to illustrate precision problems when making CD algorithm in scikit-learn support fused types.
- 3-4 July:
#6913:
Investigating the reason which cause CD not working when dealing with float32 data.
-
- 1-2 July:
#6913:
Make dtype correct and update the profiling results. (Bugs still exist :\ )
- 29-30 June:
Leave schools and move house.
- 27-28 June:
#6913:
Profile resuls.
Debugging.
Adding source files.
- 26 June:
#6913:
Trying to make ElasticNet supports fused types.
-
- New blog post -
scikit-learn KMeans Now Support Fused Types.
- 23-25 June:
#6913:
Learning coordinate descent.
- 22 June:
#6913:
Open this PR to make coordinate descent related algorithms support fused types.
- 21 June:
#6846:
Pull request merged! KMeans in scikit-learn now supports fused types.
- 19 June:
#6846:
Benchmark sparse input case and show that this PR indeed reduce memory usage
-
- 18 June:
#6905:
Open this PR to make inline and helper functions in CD to support fused types.
#6846:
Fix several bugs and add some tests suggested by reviewer.
- 17 June:
Create a minimal example to reproduce the errors occurred when Cython inheritance works with fused types.
#6846:
Remove redundant code.
Add tests for int input data.
- 15 June:
#6889:
Open this PR to make SGD support Cython fused types.
- 14 June:
#6846:
Make KMeans supports Cython fused types.
Refactor tests of KMeans.
-