How one can construct a memory-enabled agent AI that repeatedly learns by episodic experiences and semantic patterns for long-term autonomy.

by root November 17, 2025

written by root November 17, 2025 0 comment 152 views

On this tutorial, we discover find out how to construct agent techniques that suppose past a single interplay by leveraging reminiscence as a core characteristic. We present find out how to design episodic reminiscence to retailer experiences and semantic reminiscence to seize long-term patterns, permitting an agent to evolve its habits over a number of periods. As you intend, act, modify, and replicate, you may see how the agent progressively adapts to your preferences and turns into extra autonomous. Finally, we understood how memory-driven reasoning might help create brokers which can be extra contextual, constant, and clever in each interplay. Please test Full code here.

import numpy as np
from collections import defaultdict
import json
from datetime import datetime
import pickle


class EpisodicMemory:
   def __init__(self, capability=100):
       self.capability = capability
       self.episodes = []
      
   def retailer(self, state, motion, consequence, timestamp=None):
       if timestamp is None:
           timestamp = datetime.now().isoformat()
       episode = {
           'state': state,
           'motion': motion,
           'consequence': consequence,
           'timestamp': timestamp,
           'embedding': self._embed(state, motion, consequence)
       }
       self.episodes.append(episode)
       if len(self.episodes) > self.capability:
           self.episodes.pop(0)
  
   def _embed(self, state, motion, consequence):
       textual content = f"{state} {motion} {consequence}".decrease()
       return hash(textual content) % 10000
  
   def retrieve_similar(self, query_state, ok=3):
       if not self.episodes:
           return []
       query_emb = self._embed(query_state, "", "")
       scores = [(abs(ep['embedding'] - query_emb), ep) for ep in self.episodes]
       scores.kind(key=lambda x: x[0])
       return [ep for _, ep in scores[:k]]
  
   def get_recent(self, n=5):
       return self.episodes[-n:]


class SemanticMemory:
   def __init__(self):
       self.preferences = defaultdict(float)
       self.patterns = defaultdict(record)
       self.success_rates = defaultdict(lambda: {'success': 0, 'whole': 0})
      
   def update_preference(self, key, worth, weight=1.0):
       self.preferences[key] = 0.9 * self.preferences[key] + 0.1 * weight * worth
  
   def record_pattern(self, context, motion, success):
       pattern_key = f"{context}_{motion}"
       self.patterns[context].append((motion, success))
       self.success_rates[pattern_key]['total'] += 1
       if success:
           self.success_rates[pattern_key]['success'] += 1
  
   def get_best_action(self, context):
       if context not in self.patterns:
           return None
       action_scores = defaultdict(lambda: {'success': 0, 'whole': 0})
       for motion, success in self.patterns[context]:
           action_scores[action]['total'] += 1
           if success:
               action_scores[action]['success'] += 1
       best_action = max(action_scores.gadgets(), key=lambda x: x[1]['success'] / max(x[1]['total'], 1))
       return best_action[0] if best_action[1]['total'] > 0 else None
  
   def get_preference(self, key):
       return self.preferences.get(key, 0.0)

Defines the core reminiscence constructions that the agent is determined by. We construct episodic reminiscence to seize particular experiences, and semantic reminiscence to generalize patterns over time. As soon as these foundations are established, brokers can study from interactions identical to people. Please test Full code here.

class MemoryAgent:
   def __init__(self):
       self.episodic_memory = EpisodicMemory(capability=50)
       self.semantic_memory = SemanticMemory()
       self.current_plan = []
       self.session_count = 0
      
   def understand(self, user_input):
       user_input = user_input.decrease()
       if any(phrase in user_input for phrase in ['recommend', 'suggest', 'what should']):
           intent="suggestion"
       elif any(phrase in user_input for phrase in ['remember', 'prefer', 'like', 'favorite']):
           intent="preference_update"
       elif any(phrase in user_input for phrase in ['do', 'complete', 'finish', 'task']):
           intent="task_execution"
       else:
           intent="dialog"
       return {'intent': intent, 'uncooked': user_input}
  
   def plan(self, state):
       intent = state['intent']
       user_input = state['raw']
       similar_episodes = self.episodic_memory.retrieve_similar(user_input, ok=3)
       plan = []
       if intent == 'suggestion':
           genre_prefs = {ok: v for ok, v in self.semantic_memory.preferences.gadgets() if 'genre_' in ok}
           if genre_prefs:
               best_genre = max(genre_prefs.gadgets(), key=lambda x: x[1])[0]
               plan.append(('suggest', best_genre.substitute('genre_', '')))
           else:
               plan.append(('suggest', 'basic'))
       elif intent == 'preference_update':
           genres = ['sci-fi', 'fantasy', 'mystery', 'romance', 'thriller']
           detected_genre = subsequent((g for g in genres if g in user_input), None)
           if detected_genre:
               plan.append(('update_preference', detected_genre))
       elif intent == 'task_execution':
           best_action = self.semantic_memory.get_best_action('job')
           if best_action:
               plan.append(('execute', best_action))
           else:
               plan.append(('execute', 'default'))
       self.current_plan = plan
       return plan

We construct an agent recognition and planning system. It processes consumer enter, detects intent, and leverages beforehand fashioned recollections to create a plan. It begins to form how the agent causes and decides its subsequent motion. Please test Full code here.

 def act(self, motion):
       action_type, param = motion
       if action_type == 'suggest':
           if param == 'basic':
               return f"Let me study your preferences first! What genres do you get pleasure from?"
           return f"Based mostly in your preferences, I like to recommend exploring {param}!"
       elif action_type == 'update_preference':
           self.semantic_memory.update_preference(f'genre_{param}', 1.0, weight=1.0)
           return f"Acquired it! I will keep in mind you get pleasure from {param}."
       elif action_type == 'execute':
           return f"Executing job with technique: {param}"
       return "Motion accomplished"
  
   def revise_plan(self, suggestions):
       if 'no' in suggestions.decrease() or 'flawed' in suggestions.decrease():
           if self.current_plan:
               action_type, param = self.current_plan[0]
               if action_type == 'suggest':
                   genre_prefs = sorted(
                       [(k, v) for k, v in self.semantic_memory.preferences.items() if 'genre_' in k],
                       key=lambda x: x[1],
                       reverse=True
                   )
                   if len(genre_prefs) > 1:
                       new_genre = genre_prefs[1][0].substitute('genre_', '')
                       self.current_plan = [('recommend', new_genre)]
                       return True
       return False
  
   def replicate(self, state, motion, consequence, success):
       self.episodic_memory.retailer(state['raw'], str(motion), consequence)
       self.semantic_memory.record_pattern(state['intent'], str(motion), success)

Outline how the agent performs actions, corrects selections when suggestions goes towards expectations, and displays on experiences by saving them. Repeatedly enhance the habits of your brokers by having them study from each flip. By means of this loop, we make the system adaptive and self-correcting. Please test Full code here.

 def run_session(self, user_inputs):
       self.session_count += 1
       print(f"n{'='*60}")
       print(f"SESSION {self.session_count}")
       print(f"{'='*60}n")
       outcomes = []
       for i, user_input in enumerate(user_inputs, 1):
           print(f"Flip {i}")
           print(f"Person: {user_input}")
           state = self.understand(user_input)
           plan = self.plan(state)
           if not plan:
               print("Agent: I am unsure what to do with that.n")
               proceed
           response = self.act(plan[0])
           print(f"Agent: {response}n")
           success="suggest" in plan[0][0] or 'replace' in plan[0][0]
           self.replicate(state, plan[0], response, success)
           outcomes.append({
               'flip': i,
               'enter': user_input,
               'intent': state['intent'],
               'motion': plan[0],
               'response': response
           })
       return outcomes

Simulate actual interactions the place brokers course of a number of consumer inputs inside a single session. We observe the cycle of consciousness – planning – motion – reflection unfolding over and over. As you run the periods, you may see how your brokers develop into extra customized and clever over time. Please test Full code here.

def evaluate_memory_usage(agent):
   print("n" + "="*60)
   print("MEMORY ANALYSIS")
   print("="*60 + "n")
   print(f"Episodic Reminiscence:")
   print(f"  Complete episodes saved: {len(agent.episodic_memory.episodes)}")
   if agent.episodic_memory.episodes:
       print(f"  Oldest episode: {agent.episodic_memory.episodes[0]['timestamp']}")
       print(f"  Newest episode: {agent.episodic_memory.episodes[-1]['timestamp']}")
   print(f"nSemantic Reminiscence:")
   print(f"  Realized preferences: {len(agent.semantic_memory.preferences)}")
   for pref, worth in sorted(agent.semantic_memory.preferences.gadgets(), key=lambda x: x[1], reverse=True)[:5]:
       print(f"    {pref}: {worth:.3f}")
   print(f"n  Motion patterns realized: {len(agent.semantic_memory.patterns)}")
   print(f"n  Success charges by context-action:")
   for key, stats in record(agent.semantic_memory.success_rates.gadgets())[:5]:
       if stats['total'] > 0:
           price = stats['success'] / stats['total']
           print(f"    {key}: {price:.2%} ({stats['success']}/{stats['total']})")


def compare_sessions(results_history):
   print("n" + "="*60)
   print("CROSS-SESSION ANALYSIS")
   print("="*60 + "n")
   for i, leads to enumerate(results_history, 1):
       recommendation_quality = sum(1 for r in outcomes if 'preferences' in r['response'].decrease())
       print(f"Session {i}:")
       print(f"  Turns: {len(outcomes)}")
       print(f"  Personalised responses: {recommendation_quality}")

Analyze how successfully the agent makes use of its reminiscence. Try saved episodes, realized configurations, and success patterns to evaluate how your agent evolves. Please test Full code here.

def run_demo():
   agent = MemoryAgent()
   print("n📚 SCENARIO: Agent learns consumer preferences over a number of periods")
   session1_inputs = [
       "Hi, I'm looking for something to read",
       "I really like sci-fi books",
       "Can you recommend something?",
   ]
   results1 = agent.run_session(session1_inputs)
   session2_inputs = [
       "I'm bored, what should I read?",
       "Actually, I also enjoy fantasy novels",
       "Give me a recommendation",
   ]
   results2 = agent.run_session(session2_inputs)
   session3_inputs = [
       "What do you suggest for tonight?",
       "I'm in the mood for mystery too",
       "Recommend something based on what you know about me",
   ]
   results3 = agent.run_session(session3_inputs)
   evaluate_memory_usage(agent)
   compare_sessions([results1, results2, results3])
   print("n" + "="*60)
   print("EPISODIC MEMORY RETRIEVAL TEST")
   print("="*60 + "n")
   question = "suggest sci-fi"
   comparable = agent.episodic_memory.retrieve_similar(question, ok=3)
   print(f"Question: '{question}'")
   print(f"Retrieved {len(comparable)} comparable episodes:n")
   for ep in comparable:
       print(f"  State: {ep['state']}")
       print(f"  Motion: {ep['action']}")
       print(f"  End result: {ep['outcome'][:50]}...")
       print()


if __name__ == "__main__":
   print("="*60)
   print("MEMORY & LONG-TERM AUTONOMY IN AGENTIC SYSTEMS")
   print("="*60)
   run_demo()
   print("n✅ Tutorial full! Key takeaways:")
   print("  • Episodic reminiscence shops particular experiences")
   print("  • Semantic reminiscence generalizes patterns")
   print("  • Brokers enhance suggestions over periods")
   print("  • Reminiscence retrieval guides future selections")

Put all the things collectively by working a number of periods and testing reminiscence acquisition. We noticed the agent enhance throughout interactions and refine suggestions primarily based on gathered data. This complete demonstration reveals how long-term autonomy naturally arises from the reminiscence techniques we construct.

In conclusion, we discovered that by combining episodic and semantic reminiscence, we are able to construct brokers that repeatedly study and make higher selections over time. We watch brokers alter their suggestions, alter their plans, and seize previous expertise to enhance their responses with every session. By means of these mechanisms, we see how long-term autonomy can emerge from easy however efficient reminiscence constructions.

Please test Full code here. Please be happy to test it out GitHub page for tutorials, code, and notebooks. Additionally, be happy to comply with us Twitter Remember to affix us 100,000+ ML subreddits and subscribe our newsletter. cling on! Are you on telegram? You can now also participate by telegram.

Asif Razzaq is the CEO of Marktechpost Media Inc. As a visionary entrepreneur and engineer, Asif is dedicated to harnessing the potential of synthetic intelligence for social good. His newest endeavor is the launch of Marktechpost, a man-made intelligence media platform. It stands out for its thorough protection of machine studying and deep studying information, which is technically sound and simply understood by a large viewers. The platform boasts over 2 million views per 30 days, demonstrating its recognition amongst viewers.

🙌 Follow MARKTECHPOST: Add us as your preferred source on Google.

Welcome to Ivugangingo!

At Ivugangingo, we're passionate about delivering insightful content that empowers and informs our readers across a spectrum of crucial topics. Whether you're delving into the world of insurance, navigating the complexities of cryptocurrency, or seeking wellness tips in health and fitness, we've got you covered.

How one can construct a memory-enabled agent AI that repeatedly learns by episodic experiences and semantic patterns for long-term autonomy.

Cardano pockets activated after 5 years, loses $6 million in disastrous ADA to USDA swap

Younger individuals journey on Benadryl — and it is all the time a foul time

Converter

Editors Pick

Newsletter

Categories

Related Posts

Leave a Comment Cancel Reply

Latest

Best selling

Top rated

Products

Latest Posts

Welcome to Ivugangingo!

Random Picks