Research Tools & Code·arXiv cs.LG·Apr 29

Random Cloud: Finding Minimal Neural Architectures Without Training

A new training-free neural architecture search method challenges the conventional pruning pipeline by discovering minimal network topologies through random sampling and iterative reduction, then training only the final candidate. Tested across seven benchmarks, Random Cloud matches or beats magnitude and random pruning baselines on six datasets, with notable gains on Sonar (4.9pp accuracy improvement, 87% parameter reduction). The approach sidesteps the expensive train-prune-retrain cycle, potentially reshaping how practitioners think about efficiency-first architecture discovery and lowering the computational barrier to model compression.

Modelwire context

Explainer

The key detail the summary underplays is that Random Cloud never trains a model to find its architecture: it samples and reduces topology in weight space before any gradient descent occurs, which means the computational savings compound rather than just shift to a different phase of the pipeline.

This sits in a cluster of efficiency-first deployment work that Modelwire has been tracking closely. The Edge AI for Automotive piece from the same day showed knowledge distillation achieving a 3.9x compression ratio by training a smaller model to mimic a larger one, a method that still requires full training of the teacher. Random Cloud attacks the problem from the opposite direction: find the small architecture first, then train once. KAYRA, also from April 29, illustrated how deployment constraints in regulated industries push teams toward leaner models regardless of accuracy headroom. Random Cloud could lower the barrier to that kind of footprint reduction without requiring a distillation pipeline or a pretrained reference model. Together these stories suggest practitioners are converging on a shared pressure point: the cost of arriving at a deployable model, not just the cost of running it.

The benchmark set here is narrow and relatively low-dimensional. If independent groups reproduce the Sonar-level gains on standard vision benchmarks like CIFAR-100 or ImageNet subsets within the next six months, the method earns broader credibility. If results plateau on higher-complexity tasks, the approach may be limited to tabular or small-scale domains.

This analysis is generated by Modelwire’s editorial layer from our archive and the summary above. It is not a substitute for the original reporting. How we write it.

MentionsRandom Cloud

Read full story at arXiv cs.LG →(arxiv.org)

Modelwire Editorial

This synthesis and analysis was prepared by the Modelwire editorial team. We use advanced language models to read, ground, and connect the day’s most significant AI developments, providing original strategic context that helps practitioners and leaders stay ahead of the frontier.

Our mission How we write

Modelwire summarizes, we don’t republish. The full content lives on arxiv.org. If you’re a publisher and want a different summarization policy for your work, see our takedown page.