Add smoothing spline baseline by aafkevandenberg · Pull Request #299 · lumicks/pylake

aafkevandenberg · 2022-04-25T09:33:24Z

I added functions to fit a baseline with a smoothing spline.
The smoothing factor is optimized using k-fold cross validation.
Therefore I added a function to plot the mse on the test data vs the smoothing factors.

Let me know what you think.

I think there is a little tricky thing here regarding the testing. Because the distance should be used as-is (with baseline) and the force should be baseline corrected, you end up with an implicit equation in the trap position: `(trap_position - trap2_ref) = tether_length + 2 * bead_radius + 2 * (wlc_force + baseline(trap_position)) / stiffness` Note how trap position appears on the left and right thanks to the baseline.

JoepVanlier

So, I've had a first look at this and come to the same conclusion as you, that the smoothing factor based optimization doesn't really work that well unless you also downsample. This suggests that we should provide a good default for this. Otherwise, looking pretty good already!

For tests, I think adding a test which tests unique_sorted and one that tests the baseline (both code paths, with fixed and non-fixed smoothing factor) should suffice.

Adding a package involves adding it to setup.py. For conda, it might be a bit trickier since I don't see this package on conda-forge. I'd have to look into it.

To muddy the waters a bit more, another class of algorithms that's probably worth exploring for this purpose is LO(W)ESS regression.

JoepVanlier · 2022-05-04T13:07:59Z

+    x = trap_position.data
+    u, c = np.unique(x, return_counts=True)
+    m = np.isin(x, [u[c < 2]])
+    ind = np.argsort(x[m])
+
+    return x[m][ind], force.data[m][ind]


Could consider simplifying this to:

def unique_sorted(trap_position, force): """Sort and remove duplicates trap_position data to prepare for fit smoothing spline. Parameters ---------- trap_position : array_like Trap mirror position force : array_like Force data """ sorted_position, idx, count = np.unique(trap_position, return_index=True, return_counts=True) return sorted_position[count < 2], force[idx][count < 2]

Saves you a search and a sort (note that the result from np.unique is already sorted).

Given that you return the raw numpy arrays rather than slices, I would also consider just taking raw numpy arrays as input and extracting the data where its called (rather than passing a slice).

JoepVanlier · 2022-05-04T14:03:08Z

+            Trap mirror position data
+        force : lumicks.pylake.Slice
+            Force data
+        smoothing_factor : float


I wonder whether it would make sense to merge smoothing_factor and smoothing_factors.

If the input is just a value or has only one element, you use it as fixed value and otherwise you perform the optimization. You can use np.atleast_1d to upconvert a value to a numpy array so that you can always use the len operation on it. The default can then just be a default list that generally works.

JoepVanlier · 2022-05-04T14:13:52Z

+            )
+            model = csaps(x_sorted, y_sorted, smooth=smoothing_factor)
+
+        return cls(model, trap_position, force)


One issue with calling the constructor like this is that you don't actually put the data that you used for fitting in now (you fitted to x_sorted and y_sorted but store the original raw data).

JoepVanlier · 2022-05-04T14:28:26Z

+    mse_test_vals = np.zeros(len(smoothing_factors))
+    x_sorted, y_sorted = unique_sorted(trap_position, force)


One thing I find a bit surprising is that you choose to pass trap position and force to this function, despite already having x_sorted and y_sorted. Is there a specific reason you prefer this?

JoepVanlier · 2022-05-04T16:10:44Z

+    force,
+    smoothing_factors,
+    n_repeats,
+    plot_smoothingfactor_mse,


Considering that the function has smoothing_factor in the name, I reckon plot_mse would be sufficient.

JoepVanlier and others added 5 commits April 15, 2022 17:10

piezo_tracking: piezo distance from mirror position

52b3da3

piezo_tracking: support baseline correction

fdc862d

piezo_tracking: add downsampling option for feature parity

381fba8

add functions for fitting baseline with smoothing spline

14a4374

aafkevandenberg requested a review from JoepVanlier April 25, 2022 09:33

update variable names

32c3e81

JoepVanlier requested changes May 4, 2022

View reviewed changes

JoepVanlier changed the base branch from main to piezo_tracking May 5, 2022 13:58

JoepVanlier changed the base branch from piezo_tracking to main May 5, 2022 13:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add smoothing spline baseline#299

Add smoothing spline baseline#299
aafkevandenberg wants to merge 6 commits into
mainfrom
add-smoothing-spline-baseline

aafkevandenberg commented Apr 25, 2022

Uh oh!

JoepVanlier left a comment •

edited

Loading

Uh oh!

JoepVanlier May 4, 2022

Uh oh!

JoepVanlier May 4, 2022

Uh oh!

JoepVanlier May 4, 2022

Uh oh!

JoepVanlier May 4, 2022

Uh oh!

JoepVanlier May 4, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		mse_test_vals = np.zeros(len(smoothing_factors))
		x_sorted, y_sorted = unique_sorted(trap_position, force)

Conversation

aafkevandenberg commented Apr 25, 2022

Uh oh!

JoepVanlier left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

JoepVanlier May 4, 2022

Choose a reason for hiding this comment

Uh oh!

JoepVanlier May 4, 2022

Choose a reason for hiding this comment

Uh oh!

JoepVanlier May 4, 2022

Choose a reason for hiding this comment

Uh oh!

JoepVanlier May 4, 2022

Choose a reason for hiding this comment

Uh oh!

JoepVanlier May 4, 2022

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

JoepVanlier left a comment •

edited

Loading