Compression OCR a Level Computer Science

In a First, Scientists Fully Read a Charred Herculaneum Scroll—Without Ever Opening It

Miraculously, however, a library of ancient scrolls at Herculaneum survived—in a carbonized form so fragile that scholars ...

NYT slams Microsoft for building copyright-infringing supercomputer for OpenAI

In a heavily redacted court filing Thursday, The New York Times proposed to amend its copyright complaint against OpenAI and ...

Tech Times

Baidu OCR Breaks Long-Document Memory Wall: New Architecture Beats DeepSeek

Open-source OCR from Baidu eliminates the GPU memory wall that limits long-document parsing. Unlimited OCR uses a constant KV ...

IEEE

Adaptive Hybrid Framework for Multiscale Void Inspection of Chip Resistor Solder Joints

Abstract: During reflow soldering, voids inevitably emerge inside the solder joints of chip resistors, which will influence the reliability of the electronic device. In this article, an adaptive ...

GitHub

A most Frontend Collection and survey of vision-language model papers, and models GitHub repository

WeGen: A Unified Model for Interactive Multimodal Generation as We Chat 2025 📄 Paper-💾 Code VideoMind: A Chain-of-LoRA Agent for Long Video Reasoning 2025 📄 Paper 🌍 Website 💾 Code AHA: A ...

Nature

VI-OCR: “Visually Impaired” optical character recognition pipeline for text accessibility assessment

Table 1 Overview of simulation paradigms for low-vision research, illustrating the shift from perceptual and behavioral modeling to scalable, persona-based simulation enabled by large models. These ...

MIT Technology Review

DeepSeek may have found a new way to improve AI’s ability to remember

Instead of using text tokens, the Chinese AI company is packing information into images. An AI model released by the Chinese AI company DeepSeek uses new techniques that could significantly improve AI ...

Nature

Efficient GPT-4V level multimodal large language model for deployment on edge devices

Multimodal large language models have revolutionized AI research and industry, paving the way toward the next milestone. However, their large sizes and high computational costs restrict deployment to ...

GitHub

shure-dev/Awesome-LLM-Papers-Comprehensive-Topics

Agent, Minecraft Steve-Eye: Equipping LLM-based Embodied Agents wit ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results