273 lines
13 KiB
Markdown
273 lines
13 KiB
Markdown
# Multivariate Analysis of Coding.csv: Virtue Clustering and Associations
|
||
|
||
**Date:** 2026-03-28
|
||
**Dataset:** coding.csv
|
||
**Texts Analyzed:** 134
|
||
**Unique Virtue Categories:** 74
|
||
**Average Virtues per Text:** 2.78 (range: 1-5)
|
||
|
||
---
|
||
|
||
## 1. Executive Summary
|
||
|
||
This analysis examines 134 coded texts from two sources (AFP and PR) across 74 unique virtue categories. Using multiple multivariate techniques—clustering, network analysis, and association metrics—the study reveals:
|
||
|
||
- **4 distinct text clusters** with one dominant cluster containing 86% of texts
|
||
- **3 major virtue communities** representing different conceptual frameworks
|
||
- **Strong ethical pairings** (e.g., Care+Consent) that nearly always co-occur
|
||
- **Source differences** in conceptual complexity (AFP: more interconnected; PR: more focused)
|
||
|
||
---
|
||
|
||
## 2. Cluster Analysis of Texts
|
||
|
||
Using K-means clustering on binary virtue presence/absence vectors:
|
||
|
||
| Cluster | Size | Key Virtues | Sources | Interpretation |
|
||
|---------|------|-------------|---------|----------------|
|
||
| **1** | 5 texts | Memory, Imitation, Inheritance, Tradition | AFP, PR | *Memory-focused texts* - Historical and temporal continuity themes |
|
||
| **2** | 4 texts | Refusal, Embodiment, Resistance, Subversion | AFP only | *Resistance discourse* - Tactical opposition to systems |
|
||
| **3** | 115 texts | Adaptability, Tension Management, Accessibility, Design | AFP, PR | **Core protocol cluster** - Dominant protocol ethics discourse |
|
||
| **4** | 10 texts | Authenticity, Alignment, Inheritance | AFP, PR | *Authenticity/Alignment cluster* - Self-determination and tradition |
|
||
|
||
**Key Finding:** Cluster 3 represents the overwhelming majority (86%) of texts, suggesting a shared "protocol ethics" discourse across sources. Cluster 2 represents a distinct "resistance" discourse found only in AFP texts.
|
||
|
||
---
|
||
|
||
## 3. Strongest Virtue Associations
|
||
|
||
### By Co-occurrence Count (raw frequency):
|
||
|
||
| Rank | Virtue Pair | Count | Notes |
|
||
|------|-------------|-------|-------|
|
||
| 1 | Accessibility + Situational Awareness | 4 | Practical context-sensitivity |
|
||
| 2 | Equity + Inclusivity | 3 | Justice framework |
|
||
| 3 | Balance + Tension Management | 3 | Managing contradictions |
|
||
|
||
### By Jaccard Similarity (normalized association strength):
|
||
|
||
| Rank | Virtue Pair | Jaccard Index | Interpretation |
|
||
|------|-------------|---------------|------------------|
|
||
| 1 | **Care + Consent** | 0.750 | *Nearly inseparable* - Ethical foundation pair |
|
||
| 2 | Resistance + Subversion | 0.400 | Tactical cluster |
|
||
| 3 | Refusal + Subversion | 0.400 | Resistance tactics |
|
||
| 4 | **Equity + Inclusivity** | 0.375 | Justice-oriented |
|
||
| 5 | Refusal + Resistance | 0.333 | Activism tactics |
|
||
| 6 | Embodiment + Groundedness | 0.333 | Material presence |
|
||
| 7 | Agency + Freedom | 0.300 | Autonomy cluster |
|
||
|
||
**Key Finding:** The Care+Consent pairing (Jaccard = 0.750) is exceptionally strong, appearing together in 3 out of 4 possible texts where both concepts appear. This suggests an ethical foundation where care practices are inseparable from consent frameworks.
|
||
|
||
---
|
||
|
||
## 4. Virtue Communities (Network Analysis)
|
||
|
||
Using network thresholding on co-occurrence patterns, three major virtue communities were identified:
|
||
|
||
### Community 1: "Protocol Mechanics" (~40 virtues)
|
||
*Core operational virtues for protocol design and implementation*
|
||
|
||
**Central Members:**
|
||
- Adaptability, Agency, Balance, Capture Resistance
|
||
- Care, Complex Systems Tolerance, Consent
|
||
- Constraint, Curiosity, Design, Emergent Properties
|
||
- Equity, Freedom, Institutional Critique, Iterative Development
|
||
- Networked Intelligence, Plurality, Replicability, Systems Thinking
|
||
|
||
**Characteristics:**
|
||
- Largest community spanning practical and ethical dimensions
|
||
- High connectivity to Adaptability and Systems Thinking (hub virtues)
|
||
- Brings together ethics (Care, Consent, Equity) with operational concepts (Design, Iterative Development)
|
||
|
||
### Community 2: "Collective Intelligence" (3 virtues)
|
||
*Focused on collaborative knowledge production*
|
||
|
||
**Members:** Alignment, Collaboration, Networked Intelligence
|
||
|
||
**Characteristics:**
|
||
- Small but distinct community
|
||
- Emphasizes distributed, collaborative approach
|
||
- Connected to Community 1 through Networked Intelligence
|
||
|
||
### Community 3: "Relational Ethics" (~9 virtues)
|
||
*Focus on social and cultural connection*
|
||
|
||
**Members:**
|
||
- Collectivity, Cultural Awareness, Empathy, Interdependence
|
||
- Plurality, Relationality, Respect, Spatial Awareness
|
||
- Plus contextual concepts
|
||
|
||
**Characteristics:**
|
||
- Strong ties to Community 1 through Relationality
|
||
- Emphasizes interpersonal and cultural dimensions
|
||
- Includes Plurality, suggesting diversity and multiplicity
|
||
|
||
---
|
||
|
||
## 5. Network Centrality Analysis
|
||
|
||
**"Hub" Virtues** (ranked by number of connections to other virtue types):
|
||
|
||
| Rank | Virtue | Connections | Key Neighbors |
|
||
|------|--------|-------------|---------------|
|
||
| 1 | **Adaptability** | 25 | Agency, Resistance, Long-Term Vision, Design, Systems Thinking |
|
||
| 2 | **Design** | 23 | Agency, Equity, Emergent Properties, Inheritance, Constraint |
|
||
| 3 | **Agency** | 23 | Resistance, Inheritance, Refusal, Autonomy, Systems Thinking |
|
||
| 4 | **Temporal Awareness** | 19 | Emergent Properties, Long-Term Vision, Adaptability |
|
||
| 5 | **Systems Thinking** | 19 | Agency, Design, Long-Term Vision, Constraint |
|
||
| 6 | **Collectivity** | 17 | Interdependence, Agency, Shared Responsibility |
|
||
| 7 | **Transgression** | 17 | Refusal, Subversion, Care, Capture Resistance |
|
||
| 8 | **Institutional Critique** | 16 | Refusal, Design, Subversion, Agency |
|
||
| 9 | **Plurality** | 16 | Interdependence, Agency, Systems Thinking |
|
||
| 10 | **Relationality** | 16 | Interdependence, Accessibility, Care, Curiosity |
|
||
|
||
**Key Finding:** **Adaptability** is unequivocally the central hub of this virtue network, connecting to 25 other virtue concepts. This suggests it functions as a bridging concept across multiple ethical and practical domains.
|
||
|
||
---
|
||
|
||
## 6. Source Comparison (AFP vs. PR)
|
||
|
||
| Metric | AFP (62 texts) | PR (72 texts) | Interpretation |
|
||
|--------|----------------|---------------|----------------|
|
||
| **Unique virtue pairs** | 221 | 143 | AFP texts show more conceptual diversity |
|
||
| **Avg pairs per text** | 4.06 | 2.22 | AFP texts are more conceptually dense |
|
||
| **Network density** | 8.2% | 5.3% | AFP has more interconnected virtue networks |
|
||
| **Top virtues** | Adaptability (8), Temporal Awareness (7), Collectivity (7), Institutional Critique (7) | Tension Management (10), Adaptability (9), Systems Thinking (9), Infrastructural Awareness (8) | AFP: critical/social; PR: technical/systemic |
|
||
|
||
### AFP Code Profile (Academic/Critical)
|
||
- **Dominant themes:** Adaptability, Temporal Awareness, Collectivity, Institutional Critique
|
||
- **Emphasis:** Social processes, critical engagement, collective action
|
||
- **Pattern:** Higher virtue co-occurrence suggests more conceptually complex texts
|
||
|
||
### PR Code Profile (Practical/Technical)
|
||
- **Dominant themes:** Tension Management, Systems Thinking, Infrastructural Awareness
|
||
- **Emphasis:** Technical complexity, managing contradictions, system design
|
||
- **Pattern:** More focused virtue profiles, strong emphasis on Adaptability
|
||
|
||
**Key Finding:** Both sources prioritize **Adaptability**, but AFP has more distributed emphasis across critical/social virtues, while PR emphasizes technical/systemic concepts. The 8.2% vs 5.3% network density difference suggests AFP texts engage with more complex conceptual interconnections.
|
||
|
||
---
|
||
|
||
## 7. Frequency Distribution
|
||
|
||
**Top 30 Virtues by Frequency:**
|
||
|
||
| Rank | Virtue | Count | % of Texts |
|
||
|------|--------|-------|------------|
|
||
| 1 | **Adaptability** | 17 | 12.7% |
|
||
| 2 | Tension Management | 13 | 9.7% |
|
||
| 3 | Accessibility | 13 | 9.7% |
|
||
| 4 | Temporal Awareness | 11 | 8.2% |
|
||
| 5 | Design | 11 | 8.2% |
|
||
| 6 | Institutional Critique | 10 | 7.5% |
|
||
| 7 | Agency | 10 | 7.5% |
|
||
| 8 | Relationality | 10 | 7.5% |
|
||
| 9 | Infrastructural Awareness | 10 | 7.5% |
|
||
| 10 | Systems Thinking | 10 | 7.5% |
|
||
| 11 | Plurality | 9 | 6.7% |
|
||
| 12 | Transgression | 9 | 6.7% |
|
||
| 13 | Collectivity | 8 | 6.0% |
|
||
| 14 | Inheritance | 8 | 6.0% |
|
||
| 15 | Authenticity | 7 | 5.2% |
|
||
| 16 | Long-Term Vision | 7 | 5.2% |
|
||
| 17 | Equity | 6 | 4.5% |
|
||
| 18 | Capture Resistance | 6 | 4.5% |
|
||
| 19 | Respect | 6 | 4.5% |
|
||
| 20 | Cultural Awareness | 6 | 4.5% |
|
||
| 21 | Spatial Awareness | 6 | 4.5% |
|
||
| 22 | Interdependence | 6 | 4.5% |
|
||
| 23 | Shared Responsibility | 6 | 4.5% |
|
||
| 24 | Situational Awareness | 6 | 4.5% |
|
||
| 25 | Memory | 5 | 3.7% |
|
||
| 26 | Embodiment | 5 | 3.7% |
|
||
| 27 | Inclusivity | 5 | 3.7% |
|
||
| 28 | Balance | 5 | 3.7% |
|
||
| 29 | Reciprocity | 5 | 3.7% |
|
||
| 30 | Emergent Properties | 5 | 3.7% |
|
||
|
||
---
|
||
|
||
## 8. Key Insights and Implications
|
||
|
||
### 8.1 The Three Pillars of Protocol Ethics
|
||
|
||
The analysis reveals three conceptual pillars that structure this discourse:
|
||
|
||
1. **Adaptive Ethics** (centered on Adaptability and Design): The capacity to adjust, learn, and evolve protocols in response to changing conditions
|
||
|
||
2. **Relational Justice** (centered on Care, Consent, Equity, Inclusivity): Ethical frameworks emphasizing relationship, respect, and justice
|
||
|
||
3. **Systemic Resistance** (centered on Refusal, Subversion, Institutional Critique): Tactical opposition and critique of existing systems
|
||
|
||
### 8.2 The Adaptability Paradigm
|
||
|
||
The overwhelming centrality of **Adaptability** (highest frequency, highest connectivity) suggests this is the core organizing concept. It bridges:
|
||
- **Ethical dimensions:** Equity, Care, Consent
|
||
- **Operational dimensions:** Design, Iterative Development, Systems Thinking
|
||
- **Resistance dimensions:** Capture Resistance, Resistance, Agency
|
||
|
||
### 8.3 Source Convergence and Divergence
|
||
|
||
- **Convergence:** Both sources treat Adaptability as central, suggesting a shared understanding that protocols must be capable of change
|
||
- **Divergence:** AFP emphasizes critical/social dimensions (Institutional Critique, Collectivity), while PR emphasizes technical/systemic dimensions (Tension Management, Systems Thinking)
|
||
- **Integration:** The most conceptually dense texts (highest network density) come from AFP, suggesting critical theory provides more complex conceptual interconnections
|
||
|
||
### 8.4 Unexpected Pairings
|
||
|
||
Several virtue pairs show unexpected strength:
|
||
- **Care + Consent** (0.750): Suggests an ethics of care cannot exist without consent frameworks
|
||
- **Refusal + Subversion** (0.400): Tactical language clusters together
|
||
- **Equity + Inclusivity** (0.375): Justice requires both fair distribution and openness
|
||
|
||
### 8.5 The Resistance Cluster
|
||
|
||
The small cluster of resistance-focused texts (4 texts in Cluster 2) represents a distinct discourse that:
|
||
- Appears only in AFP texts
|
||
- Coheres around Refusal, Resistance, Subversion, Embodiment
|
||
- Serves as a strategic counterpoint to the dominant protocol design discourse
|
||
- May represent the critical "edge cases" that test protocol boundaries
|
||
|
||
---
|
||
|
||
## 9. Methodological Notes
|
||
|
||
### Analytic Techniques Used:
|
||
1. **K-Means Clustering** (k=4): Identified text groups based on virtue profile similarity
|
||
2. **Network Analysis**: mapped virtue co-occurrences and calculated centrality (degree = number of connections)
|
||
3. **Jaccard Similarity**: normalized measure of virtue pair association (intersection/union)
|
||
4. **Community Detection**: threshold-based clustering of highly connected virtue groups
|
||
|
||
### Limitations:
|
||
- Small dataset (134 texts) limits statistical power
|
||
- K-means clustering is sensitive to initialization (used deterministic starting points)
|
||
- Binary coding (presence/absence) doesn't capture intensity or salience
|
||
- Limited to virtues 1-5; other dimensions not analyzed
|
||
|
||
### Generated Files:
|
||
| File | Description |
|
||
|------|-------------|
|
||
| `cooccurrence_matrix.csv` | 25×25 matrix of virtue co-occurrence counts |
|
||
| `jaccard_similarity_matrix.csv` | 25×25 similarity matrix (Jaccard indices) |
|
||
| `strong_associations.csv` | Top 50 virtue pairs with association metrics |
|
||
| `virtue_profiles.json` | Individual virtue profiles for each text |
|
||
|
||
---
|
||
|
||
## 10. Recommendations for Further Analysis
|
||
|
||
1. **Qualitative Deep Dive:** Examine the 4 resistance-focused texts (Cluster 2) and the 10 authenticity-focused texts (Cluster 4) to understand the distinct discourses
|
||
|
||
2. **Temporal Analysis:** If dates are available, analyze how virtue frequencies change over time
|
||
|
||
3. **Semantic Mapping:** The Care+Consent pairing could be explored through close reading to understand the conceptual linkage
|
||
|
||
4. **Source-Specific Models:** Consider whether different theoretical frameworks might be needed for AFP vs. PR texts
|
||
|
||
5. **Expand to Other Codes:** Analysis currently limited to Virtue_1 through Virtue_5; expanding to other coding categories could reveal additional patterns
|
||
|
||
6. **Visualization:** Generate network graphs of virtue communities to make relationships visually explicit
|
||
|
||
---
|
||
|
||
*Analysis generated using Python standard library (no external packages required). All calculations are fully reproducible.*
|