Add LightGCN support to FlexMF by mdekstrand · Pull Request #1033 · lenskit/lkpy

mdekstrand · 2026-03-05T23:57:26Z

This updates FlexMF to have LightGCN support (for implicit feedback) that is activated with a new convolution_layers configuration setting. Zero layers gets classical matrix factorization.

Closes #1019.

codecov · 2026-03-06T00:56:16Z

Codecov Report

❌ Patch coverage is 88.77551% with 11 lines in your changes missing coverage. Please review.
✅ Project coverage is 89.48%. Comparing base (82d88b3) to head (3bcfe0e).
⚠️ Report is 24 commits behind head on main.

Files with missing lines	Patch %	Lines
src/lenskit/flexmf/_training.py	70.83%	7 Missing ⚠️
src/lenskit/flexmf/_model.py	93.02%	3 Missing ⚠️
src/lenskit/training.py	80.00%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #1033      +/-   ##
==========================================
+ Coverage   89.47%   89.48%   +0.01%     
==========================================
  Files         222      222              
  Lines       15372    15441      +69     
==========================================
+ Hits        13754    13818      +64     
- Misses       1618     1623       +5

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

samiravaez · 2026-03-06T23:37:04Z

user and item bias are getting default values in different places and I'm not sure which one is used at the end but the paper doesn't use bias terms for scoring as I saw, so I think it makes sense for both defaults to be false

mdekstrand · 2026-03-07T21:55:57Z

@samiravaez

user and item bias are getting default values in different places and I'm not sure which one is used at the end but the paper doesn't use bias terms for scoring as I saw, so I think it makes sense for both defaults to be false

This is a good thing to think about… there is a tradeoff in the configurability and defaults:

should the defaults be optimized for "pretty good" or for fidelity to the original paper? (I lean towards "pretty good".)
should the defaults be consistent across configurations, or have complex dependencies on other settings? (I lean towards consistent, to make it easier to configure the model and to make it easier to document.)

The original BPR doesn't include a bias term, and I don't remember if the logistic matrix factorization paper did or not.

We should experiment with this some to confirm, but I expect including item biases will usually be better, because the model doesn't have to try to model overall item popularity with the embeddings — the embeddings can focus on modeling user preference after popularity is accounted for. User biases only make sense with the logistic loss function.

I agree that we shouldn't have differing defaults. I pushed a change that removes defaulting entirely from the model class, and the explicit scorer now explicitly passes True.

(At some point, we should probably support convolution layers with explicit feedback too.)

samiravaez · 2026-03-08T00:49:14Z

@samiravaez

user and item bias are getting default values in different places and I'm not sure which one is used at the end but the paper doesn't use bias terms for scoring as I saw, so I think it makes sense for both defaults to be false

This is a good thing to think about… there is a tradeoff in the configurability and defaults:

should the defaults be optimized for "pretty good" or for fidelity to the original paper? (I lean towards "pretty good".)

should the defaults be consistent across configurations, or have complex dependencies on other settings? (I lean towards consistent, to make it easier to configure the model and to make it easier to document.)

The original BPR doesn't include a bias term, and I don't remember if the logistic matrix factorization paper did or not.

We should experiment with this some to confirm, but I expect including item biases will usually be better, because the model doesn't have to try to model overall item popularity with the embeddings — the embeddings can focus on modeling user preference after popularity is accounted for. User biases only make sense with the logistic loss function.

I agree that we shouldn't have differing defaults. I pushed a change that removes defaulting entirely from the model class, and the explicit scorer now explicitly passes True.

(At some point, we should probably support convolution layers with explicit feedback too.)

I agree on the consistency issue, that makes sense.
But having a different default for each version of FlexMF also makes sense to me. I usually lean toward setting defaults to match the originally proposed method. When people use it, the name kind of suggests that it matches the original paper. And since keeping the model minimal is kind of the core idea of the paper, it makes sense to avoid training extra parameters.

Add LightGCN support to FlexMF

mdekstrand added this to the 2026.1 milestone Mar 5, 2026

mdekstrand requested a review from samiravaez March 5, 2026 23:57

mdekstrand self-assigned this Mar 5, 2026

mdekstrand added the components LensKit recommendation components label Mar 5, 2026

mdekstrand force-pushed the feature/flexmf-lightgcn branch 2 times, most recently from ae0021b to 3be27cc Compare March 6, 2026 18:50

samiravaez reviewed Mar 6, 2026

View reviewed changes

Comment thread src/lenskit/flexmf/_model.py

samiravaez reviewed Mar 6, 2026

View reviewed changes

Comment thread src/lenskit/flexmf/_model.py Outdated

mdekstrand marked this pull request as ready for review March 9, 2026 16:45

mdekstrand added 17 commits March 9, 2026 14:45

flexmf: initial attempt to add LightGCN (lenskit#1019)

fc956af

flexmf: update lightgcn pipeline TOML to use FlexMF

edf168f

flexmf: add convolution logging

bb9deb0

flexmf: fix convolution neighbor matrix access

ff116a5

flexmf: actually pass convolution layers to model

dc2ba5a

flexmf: update GCN to no longer use out

70b588a

flexmf: refactor convolution and detach weight matrices

e5a5c87

flexmf: make model compilation optional

75cfdc1

flexmf: fix item matrix indices

d7dd47e

flexmf: rename training data matrix -> interactions

7fc1077

flexmf: go back to update_convolution design

33a2bc1

add separate torch-geometric lightgcn

16e8b75

flexmf: make LightGCN embedding sizes consistent

5f1f16c

flexmf: fix average loss output

2ab1933

flexmf: use CSR for convolutions

d6da1da

training: add basic Torch profiling support

36bafe7

cli: better output for Ctrl+C

181c7fd

mdekstrand added 5 commits March 9, 2026 14:45

flexmf: fix LightGCN score, profile

c62626e

flexmf: fix user layer indexing for explicit regularization

7e0ea84

flexmf: remove defaults for bias on FlexMFModel

f49be16

flexmf: detach layers when leaving training mode

e9227d7

cli: add more nocover pragmas

4ad1bac

mdekstrand force-pushed the feature/flexmf-lightgcn branch from 377bd01 to 4ad1bac Compare March 9, 2026 18:45

flexmf: add LightGCN to release notes

3bcfe0e

mdekstrand merged commit 911c1e1 into lenskit:main Mar 11, 2026
34 checks passed

mdekstrand deleted the feature/flexmf-lightgcn branch March 11, 2026 16:32

mdekstrand added a commit to mdekstrand/lkpy that referenced this pull request Mar 17, 2026

Merge pull request lenskit#1033 from mdekstrand/feature/flexmf-lightgcn

6371482

Add LightGCN support to FlexMF

mdekstrand mentioned this pull request Mar 17, 2026

Backport LightGCN to the 2025 release series #1046

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add LightGCN support to FlexMF#1033

Add LightGCN support to FlexMF#1033
mdekstrand merged 23 commits intolenskit:mainfrom
mdekstrand:feature/flexmf-lightgcn

mdekstrand commented Mar 5, 2026 •

edited

Loading

Uh oh!

codecov Bot commented Mar 6, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

samiravaez commented Mar 6, 2026

Uh oh!

mdekstrand commented Mar 7, 2026

Uh oh!

samiravaez commented Mar 8, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

mdekstrand commented Mar 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov Bot commented Mar 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Uh oh!

samiravaez commented Mar 6, 2026

Uh oh!

mdekstrand commented Mar 7, 2026

Uh oh!

samiravaez commented Mar 8, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

mdekstrand commented Mar 5, 2026 •

edited

Loading

codecov Bot commented Mar 6, 2026 •

edited

Loading