Building an Agent Executor for Writing Blog Posts¶
This recipe demonstrates how to build an Agent Executor using Mirascope to automate the process of researching and writing a blog post. We'll create a system that combines a researcher agent and a writer tool, orchestrated by an executor agent.
Mirascope Concepts Used
Background
Agent-based systems in AI involve creating autonomous agents that can perform tasks or make decisions. In this recipe, we're using multiple agents (researcher and writer) coordinated by an executor to create a blog post. This approach allows for a more modular and potentially more effective content creation process.
System Architecture¶
flowchart TD
AE[Agent Executor]
R[Researcher.research]
WID[_write_initial_draft]
OAPI[OpenAI API]
subgraph Researcher
RA[Researcher Agent]
WS[Web Search]
PW[Parse Webpage]
end
AE --> R
AE --> WID
R --> RA
RA --> WS
RA --> PW
WS -.-> OAPI
PW -.-> OAPI
RA -.-> OAPI
WID -.-> OAPI
AE -.-> OAPI
classDef agent fill:#e1d5e7,stroke:#9673a6,stroke-width:2px;
classDef tool fill:#fff2cc,stroke:#d6b656,stroke-width:2px;
classDef api fill:#dae8fc,stroke:#6c8ebf,stroke-width:2px;
class AE,RA agent;
class R,WID,WS,PW tool;
class OAPI api;
Setup¶
To set up our environment, first let's install all of the packages we will use:
Make sure to also set your OPENAI_API_KEY
if you haven't already.
Implementing the BaseAgent
¶
First, let's create a base OpenAIAgent
class that we can later subclass to implement specialized agents:
from mirascope.core import openai
from openai.types.chat import ChatCompletionMessageParam
from pydantic import BaseModel
class OpenAIAgent(BaseModel):
history: list[ChatCompletionMessageParam] = []
@abstractmethod
def _step(self, prompt: str) -> openai.OpenAIStream: ...
def run(self, prompt: str) -> str:
stream = self._step(prompt)
result, tools_and_outputs = "", []
for chunk, tool in stream:
if tool:
tools_and_outputs.append((tool, tool.call()))
else:
result += chunk.content
print(chunk.content, end="", flush=True)
if stream.user_message_param:
self.history.append(stream.user_message_param)
self.history.append(stream.message_param)
if tools_and_outputs:
self.history += stream.tool_message_params(tools_and_outputs)
return self.run("")
return result
Note that the _step
function is marked as an abstract method that each subclass will need to implement.
Research Agent¶
The first step to writing a good blog post is researching your topic, so let's create an agent that can search the internet and summarize relevant information that we can later consume when writing the post.
Web Search Tool¶
We can use the duckduckgo-search
package (with no API key!) to perform some basic keyword search on the internet. Note that we are including self
as an argument so that we can access the state of the Researcher
agent we will build. This enables easier configuration.
def web_search(self, text: str) -> str:
"""Search the web for the given text.
Args:
text: The text to search for.
Returns:
The search results for the given text formatted as newline separated
dictionaries with keys 'title', 'href', and 'body'.
"""
try:
results = DDGS(proxy=None).text(text, max_results=self.max_results)
return "\n\n".join(
[
inspect.cleandoc(
"""
title: {title}
href: {href}
body: {body}
"""
).format(**result)
for result in results
]
)
except Exception as e:
return f"{type(e)}: Failed to search the web for text"
Parsing HTML Content¶
Our web_search
tool only returns search results -- not the actual content of the webpages found at the href results of our search. While we could deterministically parse every web page returned, let's instead provide our researcher with a tool for parsing the content. The value of this approach is that we can greatly increase the number of search results and let the researcher decide which of the results are worth parsing and using.
def parse_webpage(self, link: str) -> str:
"""Parse the paragraphs of the webpage found at `link`.
Args:
link: The URL of the webpage.
Returns:
The parsed paragraphs of the webpage, separated by newlines.
"""
try:
response = requests.get(link)
soup = BeautifulSoup(response.content, "html.parser")
return "\n".join([p.text for p in soup.find_all("p")])
except Exception as e:
return f"{type(e)}: Failed to parse content from URL"
Researcher Step Function¶
Now that we have our tools we're ready to implement the _step
method of our researcher where the majority of the remaining work lies in engineering the prompt:
@openai.call("gpt-4o-mini", stream=True)
@prompt_template(
"""
SYSTEM:
Your task is to research a topic and summarize the information you find.
This information will be given to a writer (user) to create a blog post.
You have access to the following tools:
- `web_search`: Search the web for information. Limit to max {self.max_results}
results.
- `parse_webpage`: Parse the content of a webpage.
When calling the `web_search` tool, the `body` is simply the body of the search
result. You MUST then call the `parse_webpage` tool to get the actual content
of the webpage. It is up to you to determine which search results to parse.
Once you have gathered all of the information you need, generate a writeup that
strikes the right balance between brevity and completeness. The goal is to
provide as much information to the writer as possible without overwhelming them.
MESSAGES: {self.history}
USER: {prompt}
"""
)
def _step(self, prompt: str) -> openai.OpenAIDynamicConfig:
return {"tools": [self.web_search, self.parse_webpage]}
Implementing a research
tool method¶
While we could use the run
method from our OpenAIAgent
as a tool, there is value in further engineering our prompt by providing good descriptions (and names!) for the tools we use. Putting everything together, we can expose a research
method that we can later use as a tool in our agent executor:
class Researcher(OpenAIAgent):
max_results: int = 10
...
def research(self, prompt: str) -> str:
"""Research a topic and summarize the information found.
Args:
prompt: The user prompt to guide the research. The content of this prompt
is directly responsible for the quality of the research, so it is
crucial that the prompt be clear and concise.
Returns:
The results of the research.
"""
print("RESEARCHING...")
result = self.run(prompt)
print("RESEARCH COMPLETE!")
return result
Writing An Initial Draft¶
The next step when writing a blog is to write an initial draft and critique it. We can then incorporate the feedback from the critique to iteratively improve the post. Let's make a call to an LLM to write this first draft as well as critique it:
from pydantic import ValidationError
from mirascope.integrations.tenacity import collect_errors
class InitialDraft(BaseModel):
draft: str
critique: str
@staticmethod
def parse_initial_draft(response: InitialDraft) -> str:
return f"Draft: {response.draft}\nCritique: {response.critique}"
@retry(
wait=wait_exponential(multiplier=1, min=4, max=10),
after=collect_errors(ValidationError),
)
@openai.call(
"gpt-4o-mini", response_model=InitialDraft, output_parser=parse_initial_draft
)
@prompt_template(
"""
SYSTEM:
Your task is to write the initial draft for a blog post based on the information
provided to you by the researcher, which will be a summary of the information
they found on the internet.
Along with the draft, you will also write a critique of your own work. This
critique is crucial for improving the quality of the draft in subsequent
iterations. Ensure that the critique is thoughtful, constructive, and specific.
It should strike the right balance between comprehensive and concise feedback.
If for any reason you deem that the research is insufficient or unclear, you can
request that additional research be conducted by the researcher. Make sure that
your request is specific, clear, and concise.
MESSAGES: {self.history}
USER:
{previous_errors}
{prompt}
"""
)
def _write_initial_draft(
self, prompt: str, *, errors: list[ValidationError] | None = None
) -> openai.OpenAIDynamicConfig:
"""Writes the initial draft of a blog post along with a self-critique.
Args:
prompt: The user prompt to guide the writing process. The content of this
prompt is directly responsible for the quality of the blog post, so it
is crucial that the prompt be clear and concise.
Returns:
The initial draft of the blog post along with a self-critique.
"""
return {
"computed_fields": {
"previous_errors": f"Previous Errors: {errors}"
if errors
else None
}
}
There are a few things worth noting here:
- We are again using
self
for convenient access to the containing class' state. In this case we expect to put this function inside of our executor and want to give access to the conversation history -- particularly the results of the researcher. - We are using
response_model
to extract specifically thedraft
andcritique
fields. - We are using an output parser
parse_initial_draft
to parse theInitialDraft
class into a format that is friendly for using tools (str
). - We are using
tenacity
in order to retry should the call fail to properly generate anInitialDraft
instance, reinserting the list of previous errors into each subsequent call.
Agent Executor¶
Now we just need to put it all together into our AgentExecutor
class, write our _step
function, and run it!
class AgentExecutor(OpenAIAgent):
researcher: Researcher = Researcher()
num_paragraphs: int = 4
...
@openai.call("gpt-4o-mini", stream=True)
@prompt_template(
"""
SYSTEM:
Your task is to facilitate the collaboration between the researcher and the
blog writer. The researcher will provide the blog writer with the information
they need to write a blog post, and the blog writer will draft and critique the
blog post until they reach a final iteration they are satisfied with.
To access the researcher and writer, you have the following tools:
- `research`: Prompt the researcher to perform research.
- `_write_initial_draft`: Write an initial draft with a self-critique
You will need to manage the flow of information between the researcher and the
blog writer, ensuring that the information provided is clear, concise, and
relevant to the task at hand.
The final blog post MUST have EXACTLY {self.num_paragraphs} paragraphs.
MESSAGES: {self.history}
USER: {prompt}
"""
)
def _step(self, prompt: str) -> openai.OpenAIDynamicConfig:
return {"tools": [self.researcher.research, self._write_initial_draft]}
if __name__ == "__main__":
agent = AgentExecutor()
print("STARTING AGENT EXECUTION...")
agent.run("Help me write a blog post about LLMs and structured outputs.")
Additional Real-World Applications
-
Automated Content Marketing:
- Create a system that generates targeted blog posts for different customer segments based on current market trends and company data.
- Example: An e-commerce platform could use this to write product category overviews, incorporating latest fashion trends and customer preferences.
-
Technical Documentation Generation:
- Develop an agent that researches API changes, new features, and community feedback to automatically update and expand technical documentation.
- Example: A software company could use this to keep their SDK documentation up-to-date with each new release.
-
Personalized Learning Content:
- Build an educational tool that creates customized study materials based on a student's learning style, current knowledge, and learning goals.
- Example: An online learning platform could generate personalized course summaries and practice exercises for each student.
-
Automated News Summary and Analysis:
- Create a system that gathers news from various sources, summarizes key points, and generates analytical pieces on trending topics.
- Example: A news agency could use this to produce daily briefings on complex, evolving stories like economic trends or geopolitical events.
-
Scientific Literature Review Assistant:
- Develop an agent that can scan recent publications in a specific field, summarize key findings, and draft literature review sections for research papers.
- Example: Researchers could use this to stay updated on the latest developments in their field and to assist in writing comprehensive literature reviews.
-
Legal Document Drafting:
- Create a system that researches relevant case law and regulations to assist in drafting legal documents like contracts or briefs.
- Example: A law firm could use this to generate first drafts of standard contracts, incorporating the latest legal precedents and regulations.
-
Product Description Generator:
- Build an agent that researches product features, customer reviews, and market trends to write engaging and informative product descriptions.
- Example: An online marketplace could use this to automatically generate or update descriptions for thousands of products.
-
Travel Guide Creation:
- Develop a system that researches destinations, local attractions, and traveler reviews to create personalized travel guides.
- Example: A travel company could use this to generate custom itineraries and destination guides based on a traveler's preferences and budget.
When adapting this recipe, consider:
- Implement a feedback loop where the executor can request additional research or revisions.
- Add more specialized agents, such as an editor or fact-checker.
- Incorporate user feedback into the writing process.
- Extend the system to handle multiple blog post formats or styles.
- Implement caching for research results to improve efficiency for similar topics.
- Adjusting the prompts and system messages to fit your specific use case or writing style.
- Experimenting with different LLM models for various tasks (research vs. writing).
- Implementing error handling and logging for production use.
- Optimizing the web search and parsing functions for better performance and reliability.