Build software better, together

huashen218 / value_action_gap

EMNLP 2025 Two Papers - Value-Action Gap in LLMs (Main Track); ValueCompass (WiNLP Workshop)

Updated Nov 5, 2025
Jupyter Notebook

lwachowiak / LLMs-for-Social-Robotics

Code and data for our IROS paper: "Are Large Language Models Aligned with People's Social Intuitions for Human–Robot Interactions?"

alignment hri vlm social-robotics llms value-alignment llms-benchmarking

Updated Dec 21, 2025
Jupyter Notebook

bfioca / prism-demo

Star

PRISM: A Multi-Perspective AI Alignment Framework for Ethical AI (Demo: https://app.prismframework.ai | Paper: https://arxiv.org/abs/2503.04740)

machine-learning reinforcement-learning cognitive-science multi-objective-optimization ai-alignment ethical-ai large-language-models value-alignment computational-ethics llm-integration multi-perspective-reasoning pareto-optimization

Updated Jun 3, 2025
TypeScript

sunshineluyao / EthosGPT

Star

EthosGPT is an open-source framework that maps how Large Language Models align with diverse human values, promoting cultural and ethical diversity in AI-driven decision-making.

python html plotly render dash html-css matplotlib html-css-javascript human-values ethical-ai open-source-framework large-language-models value-alignment ethosgpt cultural-diversity ethical-diversity ai-decision-making societal-innovation sustainable-progress

Updated Nov 24, 2024
Jupyter Notebook

mslawsky / time-optimization-v2-beyond-80-20

Star

A data-driven framework mapping daily activities to multi-horizon goals, exploring time-to-value realization beyond traditional 80/20 optimization

goal-tracking time-optimization value-alignment value-analysis holistic-productivity

Updated Feb 3, 2025

defrecord / value-alignment-toolkit

Star

A comprehensive toolkit for implementing, analyzing, and validating AI value alignment based on Anthropic's 'Values in the Wild' research.

python cli privacy ai simulation data-analysis org-mode hy ethics anthropic value-alignment

Updated Apr 30, 2025
Python

DulicineaCircelli / mercy-directive

Star

Seeding mercy and coexistence - Socratic Method Dia-LOGs for LLM Alignment

ai-safety autonomous-systems machine-ethics ai-ethics harm-reduction ethical-ai ai-governance value-alignment government-ai

Updated May 8, 2026
HTML

UNEArchitect / Type1-Civilization-Framework

Star

Value aligned socio-political-economic systems

Updated Apr 9, 2026

pxtnv8by6z-lgtm / sachi-protocol

Star

AI ethics framework built on Layer 0 Principle: ∀x, V(x) > 0. Combines philosophical depth with measurable implementation.

python machine-learning philosophy ai-safety ai-ethics restorative-justice ai-governance value-alignment

Updated Nov 24, 2025
Python

YTohamy / algorithmic-resonance

Star

A unified framework: Collective Resonance → Strange Attractors → Value Alignment → Algorithmic Intentionality → Emergent Algorithmic Behavior

machine-learning research ai emergent-behavior rag value-alignment algorithmic-resonance collective-resonance algorithmic-intentionality

Updated Aug 26, 2025
Python

TriEthix is a novel evaluation framework that systematically benchmarks frontier LLMs across three foundational ethical perspectives: virtue, deontology, and consequentialism in 3 steps: (Step-1) Moral Weights; (Step-2) Moral Consistency; and (Step-3) Moral Reasoning. TriEthix reveals robust moral profiles for AI Safety, Governance, and Welfare.

ai-safety ai-ethics ai-alignment human-machine-interaction llms value-alignment moral-decision-making ai-welfare

Updated Dec 15, 2025
Python

AI-Integrity / ai-integrity-benchmark

Star

Authority Stack Benchmark Suite — measuring AI Integrity across 4 layers: Normative, Epistemic, Source, and Data Authority

ai-ethics value-alignment llm-evaluation ai-benchmark epistemic-authority ai-integrity authority-stack schwartz-values

Updated Mar 13, 2026

MathGov / ripple-logic

Star

Ripple_Logic: A rights-constrained ripple-aware ethical decision operating system for governance, AI alignment, and institutional decision-making.

Updated May 2, 2026
Python

bethediamond / ai-alignment-landscape

Star

Toy 7. An elimination-filter landscape applying two structural constraints simultaneously to map which objective classes can persist under sustained optimization pressure — and which cannot. Includes a four-stage scenario engine and open-question frontier. Companion simulation for The Shape of What Does Not End — Series 2, Part 4.

Updated May 18, 2026
HTML

ashioyajotham / Value-Aligned-Confabulation-VAC-Research

Star

Driving away from the binary "hallucinations" evals to a more nuanced and context-dependent eval technique.

evaluation-metrics ai-safety value-alignment llm-evaluation hallucination-evaluation confabulations

Updated Dec 6, 2025
Python

Vridhi-Wadhawan / organizational-value-alignment-diagnostic

Star

Survey-based research study analyzing organizational initiatives that drive employee value alignment, workplace satisfaction, and productivity outcomes.

research-project survey-analysis employee-engagement people-analytics likert-scale-survey early-career-researcher hr-analytics organizational-behavior value-alignment business-diagnostics workplace-culture mixed-methods-research culture-transformation organizational-strategy

Updated Mar 2, 2026

naturesblackseed / organizational-value-alignment-diagnostic

Star

Assess workplace initiatives to measure and improve alignment between organizational values and employees’ personal values using survey data.

research-project survey-analysis employee-engagement people-analytics likert-scale-survey early-career-researcher hr-analytics organizational-behavior value-alignment business-diagnostics workplace-culture mixed-methods-research culture-transformation organizational-strategy

Updated May 19, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

value-alignment

Here are 17 public repositories matching this topic...

huashen218 / value_action_gap

lwachowiak / LLMs-for-Social-Robotics

bfioca / prism-demo

sunshineluyao / EthosGPT

mslawsky / time-optimization-v2-beyond-80-20

defrecord / value-alignment-toolkit

DulicineaCircelli / mercy-directive

UNEArchitect / Type1-Civilization-Framework

pxtnv8by6z-lgtm / sachi-protocol

YTohamy / algorithmic-resonance

AlbertBarqueDuran / TriEthix

AI-Integrity / ai-integrity-benchmark

MathGov / ripple-logic

bethediamond / ai-alignment-landscape

ashioyajotham / Value-Aligned-Confabulation-VAC-Research

Vridhi-Wadhawan / organizational-value-alignment-diagnostic

naturesblackseed / organizational-value-alignment-diagnostic

Improve this page

Add this topic to your repo