Latest AI News

Dynamic Context Evolution for Scalable Synthetic Data Generation

arXiv:2604.07147v1 Announce Type: cross Abstract: Large language models produce repetitive output when prompted independently across many batches, a phenomenon we term cross-batch mode collapse: the progressive loss of output diversity when a language model is prompted repeatedly without access to its prior generations. Practitioners have long mitigated this with ad hoc deduplication and seed rotation, but no principled framework exists. We introduce Dynamic Context Evolution (DCE), comprising three mechanisms: (1) verbalized tail sampling (the model labels each idea with a guess about how obvious it is, and obvious ideas are discarded), which filters high-probability candidates via model self-assessment; (2) semantic memory, which maintains a persistent embedding index to reject near-duplicates across batches; and (3) adaptive prompt evolution, which reconstructs the generation prompt each batch using memory state and rotating diversity strategies. In experiments across three domains (sustainable packaging concepts, educational exam questions, and creative writing prompts) and two model families (gpt-5-mini and claude-haiku-4-5), a component ablation across 2-3 random seeds per method shows that DCE achieves 0.0 +/- 0.0% collapse versus 5.6 +/- 2.0% for naive prompting, while producing 17-18 HDBSCAN clusters per seed versus naive's volatile 2-17, indicating reliably richer conceptual structure. These results are validated with an independent embedding model (all-MiniLM-L6-v2) and hold across sensitivity sweeps of the VTS threshold tau and dedup threshold delta. Deduplication and prompt evolution are individually insufficient but jointly effective, at approximately $0.50 per 1,000 candidates using only standard API calls, with no fine-tuning or custom architectures required.

Latest News

Benchmarking LLM Tool-Use in the Wild

Dynamic Context Evolution for Scalable Synthetic Data Generation

Latest News

Benchmarking LLM Tool-Use in the Wild

Dynamic Context Evolution for Scalable Synthetic Data Generation

Hybrid CNN-Transformer Architecture for Arabic Speech Emotion Recognition

Learning to Negotiate: Multi-Agent Deliberation for Collective Value Alignment in LLMs

Evaluating In-Context Translation with Synchronous Context-Free Grammar Transduction

Demystifying OPD: Length Inflation and Stabilization Strategies for Large Language Models

Guardian-as-an-Advisor: Advancing Next-Generation Guardian Models for Trustworthy LLMs

HyperMem: Hypergraph Memory for Long-Term Conversations

Weakly Supervised Distillation of Hallucination Signals into Transformer Representations

Consistency-Guided Decoding with Proof-Driven Disambiguation for Three-Way Logical Question Answering

Extracting Breast Cancer Phenotypes from Clinical Notes: Comparing LLMs with Classical Ontology Methods

SensorPersona: An LLM-Empowered System for Continual Persona Extraction from Longitudinal Mobile Sensor Streams

A Comparative Study of Demonstration Selection for Practical Large Language Models-based Next POI Prediction

Environmental, Social and Governance Sentiment Analysis on Slovene News: A Novel Dataset and Models

FMI@SU ToxHabits: Evaluating LLMs Performance on Toxic Habit Extraction in Spanish Clinical Texts

Say Something Else: Rethinking Contextual Privacy as Information Sufficiency

Do We Need Distinct Representations for Every Speech Token? Unveiling and Exploiting Redundancy in Large Speech Language Models

A Parameter-Efficient Transfer Learning Approach through Multitask Prompt Distillation and Decomposition for Clinical NLP

MedConclusion: A Benchmark for Biomedical Conclusion Generation from Structured Abstracts

Luwen Technical Report