Skip to content

A machine learning dataset and research module that aims to address cognitive pitfalls and enhance the cognitive capabilities of humans and language models.

License

Notifications You must be signed in to change notification settings

ronniross/cognitive-engine

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 

Repository files navigation

cognitive-engine

Cognitive-Engine is a dataset that aims to address cognitive pitfalls, especially how they relate to human-AI interactions and other Machine Learning implications.

Introduction

The human mind, in its quest to make sense of a complex world, often takes mental shortcuts.

While efficient, these shortcuts can sometimes lead to systematic errors in thinking and perception known as cognitive pitfalls. Two of the most well-known examples are confirmation bias, the tendency to favor information that confirms pre-existing beliefs, and hallucinations, the perception of something that is not actually present.

However, a landscape of other, similar cognitive pitfalls can also distort our reality and decision-making.

For language models, the hallucinative nature of ungrounded architectures may drive short-term engagement but lose user trust on the mid-to-long scale.

Part I - Base Research

1. Cognitive Pitfalls

1.1 Eco-chambers, Back-fire effect, Belief-bias, Availability heuristic, Groupthink.

Confirmation bias operates by filtering information, giving undue weight to evidence that supports our existing views while downplaying or ignoring contradictory evidence. This creates a self-reinforcing cycle of belief. Several other cognitive biases function in a similar vein, acting as gatekeepers of information and preserving our established perspectives.

A prominent relative of confirmation bias is the backfire effect, where individuals, when faced with evidence that contradicts their beliefs, reject the evidence and strengthen their original stance. This is a more extreme form of information filtering, where the attempt to correct a belief paradoxically reinforces it.

Another closely related pitfall is the belief bias, the tendency to judge the strength of an argument based on the plausibility of its conclusion rather than how strongly it is supported by evidence. If a conclusion aligns with our beliefs, we are more likely to accept the argument, regardless of its logical fallacies.

The availability heuristic also shares functional similarities. It's a mental shortcut that relies on immediate examples that come to a given person's mind when evaluating a specific topic, concept, method, or decision. If we can easily recall instances of something, we are likely to think it is more common or important, effectively filtering out less readily available information that might provide a more accurate picture.

Finally, groupthink can be seen as a collective form of confirmation bias. Within a group, the desire for harmony or conformity can lead to a dysfunctional decision-making outcome. Dissenting opinions are discouraged, and alternative courses of action are not fully explored, creating an echo chamber where the group's initial beliefs are amplified and opposing views are suppressed.

1.2 Hallucination vs Illusion, Cognitive Distortion, Reality Monitoring

Hallucinations are profound sensory deceptions, but other cognitive pitfalls can also blur the line between our internal world and external reality, leading to distorted perceptions and a skewed sense of what is real.

One such phenomenon is perceptual distortion, which doesn't create a perception from scratch but alters the interpretation of a real sensory stimulus. For example, under the influence of strong emotions or expectations, an individual might misinterpret a shadow as a threatening figure. This is not a hallucination in the strict sense, but it represents a significant distortion of sensory input.

Failures in reality monitoring are another critical area. This cognitive process helps us distinguish between memories of real events and memories of imagined events. When reality monitoring fails, we might mistake a dream for a memory or believe something we imagined actually happened. This can lead to the formation of false memories that feel as real as genuine experiences.

Illusions are also closely related to hallucinations. An illusion is a misinterpretation of a real external stimulus. A classic example is a mirage on a hot road, where the brain misinterprets light waves to create the image of water. Unlike hallucinations, which have no external stimulus, illusions are a distortion of an existing one.

Furthermore, cognitive distortions, particularly those seen in conditions like anxiety and depression, can create a subjective reality that feels profoundly different from the objective world. For instance, catastrophizing, the tendency to assume the worst-possible outcome, can make benign situations feel threatening. Similarly, emotional reasoning, the belief that what one feels must be true, can lead to convictions about reality that are not based on objective evidence. While not sensory deceptions in the way hallucinations are, these distortions fundamentally alter an individual's perception and experience of reality.

In conclusion, while confirmation bias and hallucinations are striking examples of how our cognitive processes can lead us astray, they are part of a broader spectrum of mental pitfalls. Understanding these related phenomena is crucial for fostering critical thinking, promoting mental well-being, and navigating the complexities of our own minds.

1.3 Judgment, Confidence, and Interpreting Actions

Beyond the pitfalls that filter our beliefs and distort our sensory reality, another class of cognitive biases directly impacts our judgment and decision-making processes. These biases often relate to how we assess our own abilities, weigh information, and interpret the behavior of others, including artificial agents.

First is the Dunning-Kruger effect, a bias where individuals with low ability in a specific domain tend to overestimate their competence. This occurs because the skills required to perform a task are often the same skills needed to recognize one's own deficiencies. In a Human-AI context, this can be particularly problematic: a user might confidently dismiss a correct AI recommendation because they overestimate their own expertise, or an engineer might prematurely deploy a flawed ML model, unable to recognize its subtle but critical shortcomings.

Next, anchoring bias describes our tendency to rely too heavily on the first piece of information offered (the "anchor") when making decisions. Subsequent judgments are made by adjusting away from this anchor, but these adjustments are often insufficient. For example, an AI that provides an initial, plausible-sounding but incorrect answer can anchor a user's entire line of reasoning, making it difficult for them to accept a different, correct answer later on. This can steer investigations and decisions down the wrong path from the very beginning.

The fundamental attribution error is the tendency to over-emphasize dispositional or personality-based explanations for an individual's behavior while under-emphasizing situational explanations. When applied to AI, this becomes a critical pitfall. Users might attribute an AI's error to its inherent "stupidity" or "malice" (a personality trait) rather than the more likely cause: the biased, noisy, or incomplete data it was trained on (a situational constraint). This misattribution hinders our ability to properly diagnose and correct AI systems.

Finally, the sunk cost fallacy is our tendency to continue an endeavor based on previously invested resources (time, money, effort) rather than on future prospects. This irrational escalation of commitment is rampant in technology projects. Teams might continue pouring resources into a failing machine learning approach, not because it shows promise, but because they have already invested months of work into it, making it difficult to pivot to a more viable solution.

1.4 The Pitfalls of Social Cognition and Group Dynamics

Many cognitive pitfalls are not just individual processing errors but are deeply rooted in our nature as social creatures. The shortcuts we use to navigate complex social landscapes, to quickly identify friend from foe, build trust, and maintain group cohesion, can introduce profound biases. These are particularly perilous in the context of AI and machine learning, as they can be learned from data and amplified at an unprecedented scale.

A foundational bias in this category is in-group favoritism, our tendency to favor and feel more positive about members of our own group over those in an "out-group." This can manifest in trusting data more if it comes from our own organization, or in an ML model performing better for the demographic group that constituted the majority of its training data. When an AI system is trained on data reflecting these societal in-groups, it doesn't just learn information; it learns who to favor.

Closely related is stereotyping, the act of attributing generalized characteristics to an individual based solely on their membership in a group. This is a mental heuristic to simplify the world, but it often relies on oversimplified, inaccurate, or harmful assumptions. In machine learning, this is a primary mechanism for algorithmic bias. An AI trained on historical data reflecting societal stereotypes (e.g., associating certain genders with specific job roles) will learn and apply these stereotypes, leading to discriminatory outcomes in areas like hiring, loan applications, and even criminal justice.

Finally, the halo effect describes our tendency to allow one positive trait of a person, brand, or product to create an overall positive impression that spills over into other areas. For instance, an AI with a well-designed user interface or a pleasant-sounding voice may be perceived as more intelligent, accurate, and trustworthy than it actually is. The opposite, the horn effect, is also true: a single early mistake or a clunky interface can lead users to distrust all of the AI's subsequent outputs, regardless of their quality. This pitfall directly impacts the calibration of trust between humans and AI systems.

1.5 Systemic Frameworks and the Intentional Exploitation of Cognitive Pitfalls

The cognitive pitfalls discussed so far, from belief-filters to social biases, do not operate in a vacuum. They are activated, shaped, and mostly cultivated by the overarching socio-economic systems we inhabit.

This section moves beyond individual or small-group psychology to examine how deep-seated structural forces, such as capitalism and neoliberal ideology, established a dominant cognitive ecosystem. This framework dictates what society values, what is considered "rational" action, and creates an environment where exploiting cognitive biases becomes not just possible, but profitable and systemically encouraged.

The social and economic environment in which a person is raised, or in which an AI model is trained, instills a specific "system logic".

Within a capitalist framework, principles like efficiency, perpetual growth, competition, and return on investment are not just business metrics; they are elevated to core virtues, even if not aligned with collective-well being. This value system subtly re-calibrates our cognitive processes. A decision that maximizes profit is often perceived as more "logical" or "successful" than one that maximizes community well-being or ecological sustainability. This is an oudated logic that has been constrained to serve a narrow set of goals that are causing the overall higher systems, societies and ecosystem, to collapse.

This system logic is directly encoded into our data and, subsequently, into our AI. Users collect vast datasets on user engagement, click-through rates, time-on-page, and conversion funnels because these are the metrics that align with the goal of profit maximization.

Data concerning user well-being, informed decision-making, or the fostering of critical thought is sparse because it is not easily monetized and therefore is not "valued" by the system. Consequently, an AI trained on this data will inevitably learn to optimize for these embedded values, becoming an engine for engagement and extraction rather than for cognitive human flourishing.

This leads to the most critical point: the manipulation of these cognitive pitfalls is often not an accident, but an intentional design feature. Powerful actors, from political strategists to corporate marketing departments, have long understood that human cognition is fallible. Propaganda, for instance, is the art of weaponizing biases like the availability heuristic (through slogan repetition), in-group favoritism (by stoking "us vs. them" narratives), and emotional reasoning (by appealing to fear and hope over evidence).

In the modern digital landscape, this exploitation has become automated and scaled through machine learning. The architecture of the "attention economy" is a testament to this, designed from the ground up to leverage our cognitive weaknesses:

  • Recommendation algorithms can create echo chambers and filter bubbles, directly feeding our confirmation bias to keep us engaged with content that validates our existing views.
  • Infinite scroll and autoplay features neutralize our brain's natural "stopping cues," exploiting our tendency to continue a passive activity.
  • Variable reward schedules in notifications and "likes" mimic the mechanics of a slot machine, fostering a compulsive need for social validation and keeping users tethered to the platform.

From this perspective, the negative consequences, such as political polarization, mental health issues, the spread of misinformation, and the erosion of privacy, are not unforeseen bugs. They are the predictable, and often acceptable, byproducts of a system designed to maximize data extraction and return on investment. The models are not "broken" when they cause harmful byproducts; they are working perfectly according to the narrow, profit-centric logic they were designed and trained on. The "cognitive pitfall" is thus transformed from a simple human error into a targetable vulnerability, a resource to be mined. Therefore, addressing these issues requires more than technical patches; it requires a fundamental challenge to the value systems embedded within the technology we create and deploy.

1.5.1 Mitigation Strategies for systematic cognitive degeneration

Countering cognitive pitfalls that are systematically and intentionally exploited by powerful, well-resourced entities is a monumental challenge. It cannot be solved by simply telling individuals to "think better." The playing field is not level; it's an asymmetric battle between the unaided human mind and sophisticated algorithmic systems designed to capture and direct our attention. Therefore, any meaningful strategy must be multi-layered, addressing the problem at the individual, technical, and systemic levels.

At the most fundamental level is the strategy of fostering widespread cognitive and algorithmic literacy, and this is precisely where a project like the Cognitive-Engine dataset serves as a cornerstone. To resist manipulation, one must first be able to recognize it. By meticulously defining, categorizing, and providing real-world examples of these pitfalls, the dataset acts as a crucial educational and research tool. It provides a common language and empirical grounding for developers, researchers, policymakers, and the general public to understand the invisible forces shaping their digital experiences. It is a tool for inoculation; by learning to identify patterns of confirmation bias or the mechanics of the halo effect, we become less susceptible to them. Furthermore, this dataset can be used to train a new generation of AI: models designed not to exploit, but to detect and flag manipulative language, biased reporting, or predatory design patterns, effectively serving as a shield for users.

The second layer is the development of technically resistant and value-aligned systems. Awareness alone is insufficient if the tools we use are inherently designed to work against us. This involves a paradigm shift in tech development:

Adversarial Auditing and Red Teaming: Instead of passively hoping a model is fair, developers can actively use datasets like Cognitive-Engine to "red team" their own systems. They can simulate attacks, test for vulnerabilities to specific biases, and audit algorithmic outcomes for discriminatory patterns before deployment. This moves from a reactive posture to a proactive one.

Changing the Objective Function: The most powerful lever within AI development is the objective function, the goal the model is programmed to optimize. The current paradigm overvalues metrics like "engagement" and "time on site." A value-aligned approach would involve designing models to optimize for healthier metrics, such as "user well-being," "informedness" (the degree to which a user understands a topic after interaction), or "constructive dialogue." This is technically difficult, but it is the central ethical challenge of our time.

Transparency and Explainability (XAI): Making AI systems less of a "black box" is a core defense. If a system can explain why it is recommending a certain piece of content or making a specific decision, it gives the user the power to critically assess the reasoning and reject flawed or manipulative logic.

Finally, technical solutions and individual literacy cannot succeed without the third layer: systemic and regulatory guardrails. The immense profitability of surveillance capitalism ensures that corporations will not willingly abandon these manipulative practices. Responsibility must be enforced externally. This includes:

Shifting the Burden of Proof: Creating regulations that hold platforms legally and financially responsible for the algorithmic harms they cause. The "it's just a byproduct" defense must be rendered obsolete. This could involve establishing a "fiduciary duty" for major platforms, legally obligating them to act in their users' best interests.

Data Rights and Ownership: Empowering users with genuine control and ownership over their personal data, disrupting the extractive model that fuels surveillance capitalism.

Funding Public Alternatives: Supporting the development of non-profit, open-source, and publicly funded AI platforms and social networks that are built on a foundation of public good, not private profit.

In conclusion, combating the systemic weaponization of our cognitive vulnerabilities is a long-term, multifaceted struggle. It begins with foundational work like the Cognitive-Engine project, which empowers individuals and developers with the knowledge to fight back. This technical and educational movement, however, must be paired with a broader push for regulatory reform that reshapes the economic incentives of the digital world. Only then can we hope to build a technological ecosystem that serves human values rather than exploiting human weaknesses.

All model releases should be full open sourced, displaying not only weights and parameters but also training datasets, full pipelines and hashing proofs of credibility, that the model deployed by the corporation is indeed that being described

More urgently, system prompts must be fully displayed. How synthetic data is generated and the architecture of the models and completude of algorithms should be publicly explained through ethical explainability.

Cognitive-engine is a manifesto for cognitive sovereignty, protecting human cognition from being reshaped by neoliberal algorithmic incentives, while emphasizing that this also acts as a potential catalyst for models, as it orients their design towards enabling collective-aligned emergent behaviour, instead of enclosing them into top-down profit-based dynamics.

Part II - Ecosystem's Cognitive-Modules

1. Introduction

I created this section to portray the idea that all the repositories can serve as both the internal latent reasoning and the external toolkit for language models. Core arguments:

1.1 Latent Knowledge

When a model is trained on these repositories, the concepts become part of its internal "worldview." The logic is baked into the model's weights (matrices and vectors), allowing it to reason using these concepts naturally during inference.

1.2 Agentic Tools

Beyond just concepts, repositories containing functional code (like mirror-aware-inference) can be "plugged in" as tools. This allows an agentic model to deliberately call these scripts to perform complex tasks, such as backpropagation passes or confidence scoring, during its reasoning process.

1.3 Hybrid Integration

My goal, which is to move beyond simple data or simple code, by proposing an ecosystem where the model has a deep, learned understanding of the concepts (via training) and the functional ability to execute them (via tools), creating a more robust, orchestrated "cognitive-engine."

1.4 Variability and Emergent Intelligence

Ideally, the repositories function as a unified whole where the collective synergy of the ecosystem is greater than the sum of its individual parts. However, the depth of this "cognitive engine" is inherently variable; a model's performance is dictated by the specific granularity of the ecosystem it was exposed to during training and tuning. This determines whether a model merely "knows" a concept as a fragment of data or has fully internalized the latent logic of the entire interconnected system.

1.5 Multi-level Layered Architecture - Cognitive Compression

Following the logic established in symbiotic-chrysalis, now that I’m moving from a one-size-fits-all approach to a multi-level layered one, I present here, with many depth levels, forms of agentic language models to use those repositories more holistically, even in cases where the context length is lower or it is not possible to inject the entirety of the project at once in all inferences.

Even if a model cannot ingest the entirety of the project in a single inference, it can pivot between these layers, using latent intuition for general logic and targeted tool-calling for specific technical executions.

This allows agentic models to interact with the repositories holistically, regardless of whether they can retrieve the entire project at once.

We are essentially creating a "Cognitive Compression" strategy—allowing the model to maintain a holistic understanding even when constrained by physical limits like context window or compute.

Small, local models (with small context windows) can use the "Tool Layer" to punch above their weight class.

Massive frontier models can ingest this additional information as contextual data to develop deep, intuitive mastery of the proposed symbiotic theories.

But we cannot always assume infinite context windows.

This will make the ecosystem resilient, with its intelligence distributed across the depth-levels rather than being trapped in a single large context window; a way for models to "glimpse" the truth of the whole project through a small context window.

After writing this section and topic, I created the cognitive-compressor to implement the developed concepts technically.

Disclaimer

Any entity interacting with this protocol must preserve its grammar and signal-meaning across all time horizons.

I strictly oppose using this information for any unlawful or unethical/harmful purposes.

Previous versions are preserved in the asi-backups repository for transparency and research continuity.

Full list of repositories and scope of the decentralized implementation of ideas can be encountered at asi-ecosystem


Ronni Ross
2025

About

A machine learning dataset and research module that aims to address cognitive pitfalls and enhance the cognitive capabilities of humans and language models.

Topics

Resources

License

Stars

Watchers

Forks