Street AI Memory

Street AI Memory is a cross-provider memory layer for LLM applications that reduces prompt bloat as conversations grow. It sits between an app and model providers such as OpenAI, Anthropic, Gemini, DeepSeek, Together, or Groq, stores conversation signals into stacks, decays stale data, and retrieves only relevant context for each turn. The project reports 55–80% input-token reductions in a 16-turn benchmark, with average savings around 68%. It is useful for developers building chatbots, agents, RAG apps, and long-running assistants that need continuity without repeatedly sending the full transcript. The fresh Show HN launch and official GitHub README verify an installable Python package, provider adapters, local embedding model setup, and alpha-stage API notes.

Reader rating

No ratings yet

Visit website

Related tools

View all

codemap

No ratings yet

codemap is an MIT-licensed project brain for AI coding tools that gives LLMs instant architectural context from your codebase without burning tokens. It generates a fast tree/context view, dependency flow, dependency blast-radius analysis, and a layered handoff format for cross-agent continuation, then exposes everything through a JSON context bundle and an MCP server compatible with Claude Code and Codex. A built-in Codex plugin and community skill registry make it easy to install and share. Developers use codemap to onboard agents to large repos in seconds, keep session continuity across handoffs, and scope the impact of a change before running it.

Ollama

No ratings yet

Ollama is a local AI platform for running, managing, and sharing open models on your own machine or private infrastructure. It makes it easy to pull models, serve them through an API, and integrate local inference into developer workflows without relying on a fully managed cloud stack. Teams use Ollama for privacy-sensitive assistants, internal tools, offline experimentation, and rapid testing of open-weight models across laptops, workstations, and servers. It is especially useful for developers, operators, and AI builders who want quick setup with less operational overhead. What makes Ollama distinctive is how approachable it is: it packages model runtime, distribution, and deployment into a streamlined experience that helps people get productive with local AI in minutes instead of spending days on configuration.

FileForge Finder

No ratings yet

FileForge Finder is an AI-powered local file search utility that optimizes search results for developer workflows. It uses natural language processing to understand query intent and prioritize relevant files, code snippets, and documentation. The tool integrates with popular IDEs and terminals to provide instant, context-aware file retrieval, reducing time spent navigating complex project structures. It supports multiple file formats and offers advanced filtering by content type, modification date, and relevance.

From the blog

View all