Feature pruning by cowchipkid · Pull Request #106 · CogComp/lbjava

cowchipkid · 2017-08-25T15:51:03Z

The feature pruning implementation, prunes low value features for SVM, LTU subclasses and sparse nets.

…mentation.

danyaljj · 2017-08-25T16:08:15Z

lbjava/src/main/java/edu/illinois/cs/cogcomp/lbjava/Train.java

+
+                    // save the final model.
+                    System.out.println("Writing " + getName());
                    learner.save(); // Doesn't write .lex if lexicon is empty.


@cowchipkid can you clarify the change here? Confused about what's happening here ....

I moved the Accuracy reporting outside the block to ensure doneTraining() gets called before accuracy reporting. doneTraining will apply the feature optimization, we want the score after that. If accuracy is reported before doneTraining, we will get the score of the un-optimized models, which would be for not.

Ah I see. Makes sense.

cowchipkid · 2017-09-07T15:50:14Z

...rc/main/java/edu/illinois/cs/cogcomp/lbjava/learn/featurepruning/SparseNetworkOptimizer.java

+                            continue;
+                        }
+                        double wt = ltu.getWeightVector().getRawWeights().get(fi);
+


Ya, this is wrong. We need to call hasWeight here I think, this won't work for SparseAveragedPerceptron where we need to sum the past average with the actual weight.

Also, it doesn't even address absolute zero.

…n, and other LTUs.

cowchipkid · 2017-09-08T14:30:12Z

The documentation in the feature pruning stuff is in the package-info.java in the package with all the optimization stuff. It was was already in place, I enhanced it a bit, and added some links into the life cycle methods. I also fixed one bug that likely had no impact. It would not disable pruning when you would set the threshold to zero.

danyaljj · 2017-09-08T18:21:43Z

lbjava/src/main/java/edu/illinois/cs/cogcomp/lbjava/learn/Lexicon.java

        for (int i = 0; i < indexes.length; ++i) {
            Feature f = inverse.get(indexes[i]);
-            previousClassName =
+             previousClassName =


drop the extra space?

danyaljj · 2017-09-08T18:25:42Z

lbjava/src/main/java/edu/illinois/cs/cogcomp/lbjava/learn/SupportVectorMachine.java

    /** Default for {@link #bias}. */
    public static final double defaultBias = 1.0;
+    /** any weight less than this is considered irrelevant. This is for prunning. */
+    public static final double defaultFeaturePruningThreshold = 0.000001;


Is this always used in an absolute sense? I.e. it always have to be positive.
In other words, if its zero, no pruning will be done.

@danyaljj setting the threshold to zero is how you disable pruning, effectively.

danyaljj · 2017-09-08T18:43:43Z

A minor comment; Looks good to me!
Unless @mssammon plans to go through this, feel free to merge it.

mssammon · 2017-09-09T04:00:01Z

am still in the thick of post-move activity; feel free to merge.

Thomas L. Redman added 3 commits August 25, 2017 09:43

This includes the code to invoke feature pruning as well as the imple…

10bdba4

…mentation.

Fixed documentation, did some cleanup.

156fd5f

Hopefully fix an issue with the mvn plugin.

b0993ec

danyaljj reviewed Aug 25, 2017

View reviewed changes

cowchipkid requested a review from mssammon August 25, 2017 16:29

cowchipkid commented Sep 7, 2017

View reviewed changes

Thomas L. Redman added 2 commits September 7, 2017 11:41

Sparse net optimizer had a bit when used with SparseAveragedPerceptro…

e7a2aea

…n, and other LTUs.

Fixed up the docs a bit, and changes fixed an issue in the optimizer.

51ae957

danyaljj reviewed Sep 8, 2017

View reviewed changes

cowchipkid self-assigned this Sep 12, 2017

cowchipkid merged commit 32587db into master Sep 12, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature pruning#106

Feature pruning#106
cowchipkid merged 5 commits intomasterfrom
FeaturePruning

cowchipkid commented Aug 25, 2017

Uh oh!

danyaljj Aug 25, 2017

Uh oh!

cowchipkid Aug 25, 2017

Uh oh!

danyaljj Sep 7, 2017

Uh oh!

cowchipkid Sep 7, 2017

Uh oh!

cowchipkid Sep 7, 2017

Uh oh!

cowchipkid commented Sep 8, 2017

Uh oh!

danyaljj Sep 8, 2017

Uh oh!

danyaljj Sep 8, 2017

Uh oh!

cowchipkid Sep 12, 2017

Uh oh!

danyaljj commented Sep 8, 2017

Uh oh!

mssammon commented Sep 9, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

cowchipkid commented Aug 25, 2017

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

cowchipkid commented Sep 8, 2017

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

danyaljj commented Sep 8, 2017

Uh oh!

mssammon commented Sep 9, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants