EmojiFold: Emoji Semantic Manifold Discovery

Discover the mathematical structure underlying emoji semantics through systematic analysis of oppositional pairs.

🚀 Quick Start

# Initialize project
cd /Users/rob/repos/emojifold
uv venv
source .venv/bin/activate
uv pip install -e .

# Database location: ~/.emojifold/emojifold.db
# Run small test
emojifold test --pairs 50

# Run full overnight analysis
emojifold batch --model all --output results/

# Calculate model centroids
python calculate_centroids.py --db ~/.emojifold/emojifold.db

🎯 Project Goals

Mass computation of semantic distances between emoji pairs
Discovery of strongest oppositional dimensions in emoji space
Cross-model validation of semantic manifold structure
Efficient storage and analysis of large-scale results

🔬 Key Research Questions

Mathematical vs Visual: How do math-oriented models compare to general embeddings?
Universal Oppositions: Which emoji pairs are strong across all models?
Dimensional Structure: What are the fundamental axes of emoji semantics?

📊 Expected Results

With ~3,000 emoji testing ~9 million pairs overnight, we should discover:

Top 100 strongest oppositional pairs
Model-specific vs universal patterns
Natural clustering of semantic dimensions
Validation of our ⚫⚪ and ➕➖ findings

🛠️ Technical Stack

UV for dependency management
Ollama + HuggingFace for embeddings
Optimized for M2 Ultra (64GB) + optional 4090
SQLite for efficient result storage
Async/parallel processing for maximum throughput

Ready to map the emoji semantic universe! 🌌

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
emojifold		emojifold
.env.example		.env.example
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md
calculate_centroids.py		calculate_centroids.py
check_completeness.py		check_completeness.py
check_db.py		check_db.py
check_metrics.py		check_metrics.py
config.yaml		config.yaml
ecosystem.config.json		ecosystem.config.json
ecosystem_v3.config.json		ecosystem_v3.config.json
generate_emoji_db.py		generate_emoji_db.py
pyproject.toml		pyproject.toml
schema_v2.sql		schema_v2.sql
schema_v3_codepoint.sql		schema_v3_codepoint.sql
sequential_beast.py		sequential_beast.py
test_centroids.py		test_centroids.py
test_single_centroid.py		test_single_centroid.py
unicode_emojis.json		unicode_emojis.json
unicode_emojis.sql		unicode_emojis.sql

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EmojiFold: Emoji Semantic Manifold Discovery

🚀 Quick Start

🎯 Project Goals

🔬 Key Research Questions

📊 Expected Results

🛠️ Technical Stack

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

EmojiFold: Emoji Semantic Manifold Discovery

🚀 Quick Start

🎯 Project Goals

🔬 Key Research Questions

📊 Expected Results

🛠️ Technical Stack

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages