Top ML Papers of the Week (April 22 — April 28)

Thongchan Thananate
2 min readApr 30, 2024
  1. Phi-3: Phi-3 is a compact yet powerful language model tailored for mobile devices. Despite its smaller size, it boasts performance levels comparable to larger models such as GPT-3.5, thanks to its training on extensive datasets.
  2. OpenELM: OpenELM is an open-source language model renowned for its efficiency. Utilizing a layer-wise scaling approach, it optimizes parameter allocation, aiming for transparency and reproducibility in AI research. Its focus lies in enhancing accuracy while minimizing the reliance on pre-training tokens.
  3. AutoCrawler: Addressing web automation challenges, AutoCrawler merges large language models with crawlers. Employing a two-stage process, it capitalizes on HTML’s hierarchical structure for improved action generation and error learning.
  4. Self-Evolution of LLMs: This paper explores self-evolution strategies in large language models (LLMs), enabling models to autonomously refine experiences, potentially advancing towards superintelligence. It delineates a conceptual framework for self-evolution in LLMs.
  5. AI-powered Gene Editors: AI is transforming genome editing by enhancing precision, efficiency, and cost-effectiveness. Models like DeepCRISPR and CRISTA predict optimal guide RNAs for CRISPR-Cas systems, facilitating various genome editing techniques.
  6. Make Your LLM Fully Utilize the Context: Introducing FILM-7B, this paper addresses the “lost-in-the-middle” challenge encountered by LLMs in leveraging long contexts effectively. It proposes a novel training methodology emphasizing information retrieval from any position within a lengthy context.

--

--

Thongchan Thananate

People might laugh at it or call it foolish logic, but that’s enough for me. That’s what romanticism is about!