Skip to main content
asad
Written by asad
Last updated 2 months ago
Once you’ve imported your chat history, you can use it to add relevant context to your new AI conversations.

See It In Action

Here’s what Chat History Memory does for your conversations: Without MemoryPlugin: Without MemoryPlugin The AI has no context about your past conversations and gives generic responses. With MemoryPlugin: With MemoryPlugin The AI sees relevant context from your chat history and provides personalized, informed responses based on your previous discussions.

How It Works Behind the Scenes

When chat history context is requested, MemoryPlugin performs intelligent retrieval and summarization:
  1. Query Understanding - Your query is analyzed by an AI to generate multiple search variations and extract temporal filters (like “last month”)
  2. Hybrid Search - Runs semantic (meaning-based) and keyword searches in parallel across your chat history
  3. Reranking - Results are reranked by relevance to your specific query
  4. Context Expansion - For each match, surrounding messages are fetched for complete context
  5. Intelligent Summarization - Expanded context is summarized to fit your token budget (e.g., 50,000 tokens of raw conversation might be summarized into 2,000 tokens)
This isn’t just dumping raw chunks—it’s intelligently finding, expanding, and condensing the most relevant parts of your conversations.
Each inject call typically takes 3-4 seconds to complete all these steps.

Browser Extension

The Browser Extension works across all major AI platforms. Supported Platforms: ChatGPT • Claude • Gemini • Google AI Studio • Grok • Poe • DeepSeek • Qwen • And more… How it works:
  • Performs one round of context retrieval per message
  • Can be configured for automatic injection (adds context to every message) or manual (only when you click the MemoryPlugin button)
  • Uses default token limits configured in extension settings
Chat History Memory is completely separate from Regular Memory in the browser extension. They use different systems and buttons.

Remote MCP Server (More Powerful)

The Remote MCP Server offers the most powerful and flexible way to use Chat History Memory. Supported Platforms: Claude Desktop • Claude Web • Claude Mobile • Mistral AI • Cursor • Continue • Other MCP-compatible clients Why it’s more powerful:
  • Multiple rounds of retrieval - Can fetch context multiple times in a single conversation
  • Parallel queries - Send an array of queries that all run simultaneously (e.g., 12 different searches about a topic)
  • User control - You tell the AI exactly when and how to fetch context through natural language
  • Token control - Specify how many tokens of context to fetch (e.g., “fetch 2000 tokens”)
  • Fetch large amounts - Can retrieve 20,000-30,000 tokens of context when needed
How to use it: You control the MCP server through natural language instructions to the AI:
"At the start of every conversation, use the memoryplugin_inject tool
to learn about my skills and learning style"

"Use the memoryplugin inject tool to fetch context about my hobbies"

"Make 5 parallel inject queries to fetch context about this project
from different angles - one for technical details, one for design
decisions, one for user feedback, one for implementation challenges,
and one for future plans"
The AI will make the tool calls you request, MemoryPlugin returns the context, and the AI weaves it naturally into its response.
Best platforms: Claude (Desktop, Web, or Mobile) and Mistral AI with the Remote MCP Server provide the best experience. These models excel at agentic tool use and naturally incorporating retrieved context into responses.

Controlling What Gets Added

Use exclusions to remove specific chats from being used as context:
  • Sensitive conversations you want to keep private
  • Off-topic chats that aren’t useful for context
  • Outdated discussions no longer relevant
When you exclude a chat, its data is deleted and it won’t be re-imported in future uploads. A placeholder remains to prevent automatic re-processing.

Searching and Analyzing Your History

Beyond adding context to conversations, you can actively search and analyze your chat history:

Search Your History

Perform powerful semantic searches across all your conversations. Find specific discussions by topic, keyword, or timeframe.Go to Dashboard → Search

Ask Questions

Get AI-powered answers synthesized from your entire chat history. Ask natural language questions and receive comprehensive responses with citations.Go to Dashboard → Ask

Tips for Best Results

When using the MCP server, tell the AI exactly what context you want. Instead of just activating MemoryPlugin, say “fetch context about my React projects” or “get information about my cooking preferences from past conversations.”
For comprehensive coverage, request multiple parallel queries: “Make 10 inject queries to fetch context about this topic from different angles.” The AI can search for technical details, design decisions, user feedback, and more simultaneously.
Sometimes you want extensive context (20+ queries, high token count). Sometimes you want none. Useful for all kinds of tasks—even creative writing or documentation where past context helps maintain consistency.
You can tell the AI when to fetch context: at the start of every conversation, only when you mention it, for specific types of questions, etc. It’s entirely up to you.

Limitations

Chat History currently doesn’t understand how facts have changed over time. If you struggled with understanding photosynthesis in March but mastered it by June, the AI might not know which is current. This is something we’re working to improve.
Results depend on how well your AI model uses the retrieved context. Some models are better at incorporating external context than others.

Troubleshooting

Solutions:
  • Ensure Browser Extension or Remote MCP Server is installed and signed in
  • For browser extension: Check that Chat History is enabled in settings
  • For MCP: Make sure you’re explicitly telling the AI to use the inject tool
  • Verify you have uploaded and processed chat history
Solutions:
  • Be more specific in your prompts or instructions to the AI
  • Exclude irrelevant chats from the Chats tab
  • For MCP: Tell the AI to use fewer tokens or be more specific in its queries
If issues persist, contact support with: - Platform you’re using (ChatGPT, Claude, etc.) - Integration method (Browser Extension or Remote MCP Server) - Description of the issue

Next Steps