fix(mnesia): load ram_copies table from 'better' copy when start by qzhuyan · Pull Request #104 · emqx/otp

qzhuyan · 2026-05-11T17:32:29Z

Without this fix, ram_copies is 'safe loaded' locally when remote nodes have not been connected (yet). This makes the table accessible by application becasue 'where_to_read' is set to local node().

mnesia:dirty_read(emqx_route_filters,1).
[]

Also when remote node is connected afterwards, ram_copies table will not load the 'better' copy, this makes the data inconsistent within the cluster.

This commit fix that during mnesia start, ram_copy table should NOT do local safe load when there is a better copy from the remote. the where_to_read will stay in 'nowhere' and table access won't be served.

mnesia:dirty_read(emqx_route_filters,1).
** exception exit: {aborted,{no_exists,[emqx_route_filters,1]}}

After remote node is connected, local node will do net_load_table from the remote.

Resue adopt_orphans function to resolve the conflicts when there is a deadlock of deciding the better copy, that is the same behaviour for disc_copies tables.

note1: BetterCopies0 = mnesia_lib:remote_copy_holders(Cs) -- Downs

note2: disc copy table has no such issue.

note3: if there is no better copy (when other nodes are down before current one), it is correct to load from local.

Without this fix, ram_copies is 'safe loaded' locally when remote nodes have not been connected (yet). This makes the table accessible by application becasue 'where_to_read' is set to local node(). ``` mnesia:dirty_read(emqx_route_filters,1). [] ``` Also when remote node is connected afterwards, ram_copies table will not load the 'better' copy, this makes the data inconsistent within the cluster. This commit fix that during mnesia start, ram_copy table should NOT do local safe load when there is a better copy from the remote. the where_to_read will stay in 'nowhere' and table access won't be served. ``` mnesia:dirty_read(emqx_route_filters,1). ** exception exit: {aborted,{no_exists,[emqx_route_filters,1]}} ``` After remote node is connected, local node will do `net_load_table` from the remote. Resue `adopt_orphans` function to resolve the conflicts when there is a deadlock of deciding the better copy, that is the same behaviour for disc_copies tables. note1: BetterCopies0 = mnesia_lib:remote_copy_holders(Cs) -- Downs note2: disc copy table has no such issue. note3: if there is no better copy (when other nodes are down before current one), it is correct to load from local.

qzhuyan added 2 commits May 11, 2026 15:01

rel: EMQX OTP-27.3.4.17-2

358bbd6

qzhuyan marked this pull request as ready for review May 12, 2026 04:47

terry-xiaoyu approved these changes May 12, 2026

View reviewed changes

qzhuyan merged commit 319f3f6 into emqx:emqx-OTP-24.3.4 May 12, 2026
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(mnesia): load ram_copies table from 'better' copy when start#104

fix(mnesia): load ram_copies table from 'better' copy when start#104
qzhuyan merged 2 commits into
emqx:emqx-OTP-24.3.4from
qzhuyan:backport/william/otp24/mnesia-ram-copies-safeload-during-start

qzhuyan commented May 11, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

qzhuyan commented May 11, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants