Towards Responsible Generative AI:
A Reference Architecture for Designing Foundation Model based Agents

Qinghua Lu, Liming Zhu, Xiwei Xu, Zhenchang Xing, Stefan Harrer, Jon Whittle
Data61, CSIRO, Australia
Email: {firstname}.{lastname}@data61.csiro.au

Abstract

Foundation models, such as large language models (LLMs), have been widely recognised as transformative AI technologies due to their capabilities to understand and generate content, including plans with reasoning capabilities. Foundation model based agents derive their autonomy from the capabilities of foundation models, which enable them to autonomously break down a given goal into a set of manageable tasks and orchestrate task execution to meet the goal. Despite the huge efforts put into building foundation model based agents, the architecture design of the agents has not yet been systematically explored. Also, while there are significant benefits of using agents for planning and execution, there are serious considerations regarding responsible AI related software quality attributes, such as security and accountability. Therefore, this paper presents a pattern-oriented reference architecture that serves as guidance when designing foundation model based agents. We evaluate the completeness and utility of the proposed reference architecture by mapping it to the architecture of two real-world agents.

Key terms - Foundation model, large language model, LLM, agent, architecture, pattern, responsible AI, AI safety.

1 Introduction

Foundation models (FMs) [1], such as large language models (LLMs), have been widely recognised as transformative generative artificial intelligence (GenAI) technologies due to their remarkable capabilities to understand and generate content. FMs are pretrained on massive amounts of data and can be adapted to perform a wide variety of tasks and significantly improve productivity. Significant efforts have been placed on utilising FMs’ human-like reasoning capabilities for a diverse range of downstream tasks, such as question answering and information summary. However, it is crucial to acknowledge that FMs exhibit inherent limitations, particularly when facing complex tasks. Users are required to provide prompts at each individual step, which can be inefficient and prone to errors. There have been recently a rapidly growing interest in the development of FM-based autonomous agents [2, 3], such as Auto-GPT¹¹1https://github.com/Significant-Gravitas/Auto-GPT and BabyAGI²²2https://github.com/yoheinakajima/babyagi. With autonomous agents, users only need to provide a high-level goal, rather than providing explicit step-by-step instructions. These agents derive their autonomy from the capabilities of FMs, enabling them to autonomously break down the given goal into a set of manageable tasks and orchestrate task execution to fulfill the goal. While huge efforts have been put on building FM-based agents, the architecture design of the agents has not yet been systematically explored. Many reusable solutions have been proposed to address the diverse challenges for designing FM-based agents, which motivates the design of a reference architecture for FM-based agents.

On the other hand, there are significant challenges in responsible AI when designing FM-based agents [4, 5]. First, autonomy is a core feature enabled by FM-based agents. These agents can infer human intentions and goals, either explicitly or implicitly, from multimodal context information, generate plans, use external tools/systems, cooperate with other agents, and may even create new tools and agents. In such cases, the accountability for actions taken by the agents may be shared among the agent owner, the FM provider, and various providers of external tools/agents. Second, enabling accountability necessitates underlying supporting mechanisms for traceability. Third, the goals or instructions set by humans for the agents, as well as the agents’ outputs and behaviour, including interaction with external tools and other agents, must be trustworthy and responsible.

Therefore, we have performed a systematic literature review (SLR) on FM-based agents. Based on the review results, a collection of architectural components and patterns have been identified to address different challenges of agent design. This paper presents a pattern-oriented reference architecture (RA) that serves as architecture design guidance for designing FM-based agents. We evaluate the completeness and utility of the proposed reference architecture by mapping it to the architecture of two real-world agents: MetaGPT [6] and HuggingGPT [7]. The contribution of this paper includes:

•

A reference architecture for FM-based agents that can be used as a template to guide the architecture design.
•

A collection of architectural patterns that can be utilised in the architecture design of FM-based agents to ensure trustworthiness and address responsible AI related software quantities.

2 Methodology

This section presents the methodology employed in this study. An empirically-grounded design methodology has been adopted [8]. Our initial step involving determining the type of our reference architecture. We decided to design an industry-crosscutting, classical, facilitation reference architecture. Here, “industry-crosscutting” means that the reference architecture can span multiple industries, ”classical” indicates that its development is based on existing FM-based agents, and “facilitation” signifies that its aim is to guide the future design of FM-based agents, whether within a single organisation or across multiple organisations. Our design strategy is a combination of “research-driven” and “practice-driven”, as the design of this reference architecture is founded mainly on the findings of an SLR which includes both academic research papers and industry papers. Our project experiences in designing the FM-based scientific discovery agent and responsible AI copilot have also been beneficial for this study, particularly in multimodal context engineering and responsible AI. The third step is the empirical acquisition of data. We performed an SLR to identify the relevant studies. The key search terms include foundation model and agent. The supportive terms are large language model, LLM, FM. We finally identified 57 studies after conducting the paper search, snowballing, and quality assessment. Based on the acquired data, we constructed a reference architecture for FM-based agents by integrating the architectural patterns and components identified. We annotated different patterns that can lead to the instantiation of various concrete architectures for FM-based agents. In the final step, the evaluation of our proposed reference architecture was carried out by reviewing two real-world agents. We mapped the architectural components of the two agents onto the proposed reference architecture, to evaluate that the reference architecture can be transformed into meaningful concrete architectures.

Refer to caption — Figure 1: Architecture of an agent-based ecosystem.

3 Reference Architecture

Fig. 1 provides an architectural overview of an agent-based ecosystem. Users define high-level goals for the agents to achieve. The agents can be categorised into two types [9, 10, 11]: agent-as-a-coordinator and agent-as-a-worker. Agents in the coordinator role primarily formulate high-level strategies and orchestrate the execution of tasks by delegating task execution responsibilities to other agents, external tools, or non-agent systems. On the other hand, agents in the worker role need to generate strategies and execute specific tasks in line with those strategies. To complete these tasks, agents in the worker role may need to cooperate or compete with other agents, or call external tools or non-agent AI/non-AI systems. Fig. 2 illustrates the reference architecture of an FM-based agent.

3.1 Interaction engineering

Interaction engineering comprises two components: context engineering and prompt/response engineering. Context engineering is designed to collect and structure the context in which the agent operates to understand the user’s goals [12], while prompt/response engineering is responsible for generating prompts/responses, enabling the FM-based agents to successfully achieve the human’s goals. There are two patterns that can be applied for comprehending and shaping the goals: passive goal creator and proactive goal creator. Passive goal creator analyses the user’s articulated goals, as described through text prompts submitted by the user via the dialogue interface [13, 14]. Conversely, proactive goal creator goes beyond the explicit user text prompt and anticipates the user’s goals by understanding the user interface (UI) of relevant tools and human interaction [15]. This is facilitated through the analysis of multimodal context information, including screen recording [16], mouse clicks³³3https://github.com/ddupont808/GPT-4V-Act⁴⁴4 https://www.loom.com/share/9458bcbf79784162aa62ffb8dd66201b, typing, eye tracking, gestures [15], document annotations and notes. In the multi-agent environment, proactive goal creator can also anticipate other agents’ decisions and formulate plans proactively [17]. Both passive goal creator and proactive goal creator need to consider the personas created by the users or the other agents through the persona creator [18, 19]. The persona can include the roles, communication style, expertise capabilities, limitation boundaries, etc. Once the goal is confirmed by the user, prompt/response generator automatically generates the prompts/responses with constraints and specifications, defining the desired input or output content and format in alignment with the the ultimate goal. A prompt or response template is often used in the prompt/response generator as a factory that creates prompt or response instances from the template [20, 21]. The template provides a structured way to standardise the queries and responses, which can improve the response accuracy and interoperability with external tools or agents.

3.2 Memory

The agent’s memory stores current context information and historical data and knowledge that have been accumulated over time to inform planning and actions. The memory is structured using short-term memory and long-term memory. Short-term memory refers to the information within the context window of the FM, which is in-context and can be accessed by the FM during inference. The information stored in short-term memory includes configurations, recent events, and working context [22, 23, 20, 24]. The configurations encompass the agent’s persona, capabilities, constraints, function description, etc. The events cover user interactions (e.g., prompts, clicks), system messages, and other events that have occurred recently. The working context refers to the status of the task or set of tasks that the agent is currently working on. The length of short-term memory is limited by the context window length of the FM, which is the length of text the FM can take as input when generating a response. For example, the latest version of GPT-4 has a maximum limit of 128k tokens, equivalent to approximately 300 pages of text⁵⁵5https://openai.com/blog/new-models-and-developer-products-announced-at-devday. It is challenging to provide prompts containing data larger than the token limit. To enhance the storage capacity of memory, long-term memory refers to the information maintained outside the context window of the FM. This long-term memory stores two types of information [22, 23]: the entire history of events processed, and the knowledge and past experiences (e.g., observations, thoughts, behaviour). For the FM to process this information, it must be explicitly selected from long-term memory through memory retrieval and moved into short-term memory.

3.3 Planning

To achieve the user’s goal, the agent needs to work out strategies and make a plan accordingly [25]. There are two design patterns for plan generation: single-path plan generator and multi-path plan generator. The single-path plan generator orchestrates the generation of intermediate steps leading to the achievement of the user’s goal. Each step is designed to have only one subsequent step, such as Chain-of-Thought [26]. Self-consistency asks the FM several times and selects the most consistent answer as the final answer [27]. On the other hand, multi-path plan generator allows for the creation of multiple choices at each step. Each intermediate step may lead to multiple subsequent steps. Self-consistent Chain-of-Thought [28] and Tree of Thought [29] exemplify this design pattern. Both the single-path plan generator and the multi-path plan generator can involve two design patterns of model querying: one-shot model querying and incremental model querying. In one-shot model querying, such as Chain-of-Thought [26] and self-consistent Chain-of-Thought [28], the FM is accessed in a single instance to efficiently generate all the necessary reasoning steps for the plan. The incremental model querying, such as Tree of Thought [29], involves accessing the FM at each step of the plan generation process, which enables a more iterative and interactive planning process that can adapt to evolving circumstances or changing user requirements. Plan reflection allows the agent to incorporate feedback to refine the plan [30] through three design patterns: self-reflection, cross-reflection, and human reflection. Self-reflection enables the agent to generate feedback on the plan and provide refinement guidance from themselves [31, 32, 33, 34]. Cross-reflection uses different agents or FMs to provide feedback and refinement on the plan [35, 34, 36]. The agent can collect feedback from humans to refine the plan, which can effectively make the plan aligned with the humans preference [37, 38].

3.4 Execution engine

Once the plan is determined, the role of the execution engine is to put the plan into action. The task executor is responsible for performing the tasks outlined in the plan. A task monitor is necessary to monitor the task’s execution status and manage the tasks queued for execution [39, 40]. Apart from directly utilizing internal knowledge of FMs or extracting external knowledge through retrieval augmented generation (RAG) to guide the action execution, the task executor can cooperate with other agents or leverage external tools [41, 42], which expands the capabilities of the agent. The tool/agent selector [43, 36] can perform a search in the tool/agent registry/marketplace [41, 44, 42] or on the web to find the relevant tools and agents to complete the tasks. Humans can trade or hire specialised agents in the marketplace. The owners of these agents can receive incentives as they build agents’ memory based on their own knowledge or skills. An FM-based ranker can be applied to analyse the performance of the tools/agents and identify the best ones [10]. The tool/agent generator can automatically create tools and agents based on natural language requirements [45, 46].

There are three design patterns to support multi-agent cooperation: voting-based cooperation, role-based cooperation, and debate-based cooperation. In the voting-based cooperation, agents can freely provide their opinions and reach consensus through voting [47]. The cooperation can be also reached by assigning different roles to agents through role-based cooperation [19, 48, 49, 48, 19]. One special type of role-based cooperation is the competitive auction mode. This involves two roles: the auctioneer who proposes a task and orchestrates the auction process, and the bidders who actively participate in the bidding [50, 51]. Debate-based cooperation integrates concepts from game theory, where one agent can receive feedback from other agents [52, 53] and adjust the thoughts and behaviours until consensus is reached [54, 55].

3.5 Responsible AI (RAI) plugins

To ensure responsible AI, a set of patterns can be adopted as plugins [56]. A continuous risk assessor [57, 58] continuously monitors and assesses AI risk metrics to prevent the misuse of the agent and to ensure the trustworthiness of the agent. A black box recorder [4] records the runtime data, which can be then shared with relevant stakeholders to enable transparency and accountability. The recorded data includes the input, output, and intermediate data for each component within the architecture, such as the input and output of the FMs, external tools, or other RAI plugins (e.g. guardrails). All these data need to be kept as evidence with the timestamp and location data, e.g., using a blockchain-based immutable log.

A specific type of monitoring is called guardrails, which is a layer in between FMs or fine-tuned FMs and other components or systems. They are designed to control the inputs and outputs of FMs to meet specific requirements, such as user requirements, ethical standards, laws. Guardrails can be built on an RAI knowledge base, or RAI narrow models or RAI FMs. The RAI FMs can be fine-tuned or can call upon an RAI knowledge base to support RAI controls. There can be different types of guardrails. Input guardrails are applied to the inputs received from users, such as refusing or modifying user prompts. For example some users’ prompts may contain personally identifiable information (PII) and need to be removed by employing data de-identification and anonymisation [59] before being sent to the FMs. Output guardrails focus on the output generated by the FM, such as modifying the output of the FM or preventing certain outputs from being returned to the user. RAG guardrails are used to ensure the retrieved data is appropriate, either by refusing or modifying the retrieved data. Execution guardrails are applied to input/output of the narrow AI models or external tools that the FM may invoke for action execution. Whitelists and blacklists can be established to identify actions that are permitted and prohibited respectively [60]. During the workflow execution, intermediate guardrails can be used to guarantee that each intermediate step meets the necessary criteria. This involves checking if an action should be carried out or determining if the FM should be invoked, or deciding if a predefined response should be used instead, etc.

To provide comprehensive monitoring and control, multimodal guardrails can be designed to prevent inappropriate multimodal inputs sent to the FM, whether those inputs are from the users or other software components or external tools or models. Prevent inappropriate multimodal outputs generated by the FM itself, whether those outputs are sent to the user or other software components or external tools or models. For example, if UIs are generated dynamically on demand, multimodal guardrails can effectively flag any UI elements that fail to meet specific requirements such as standards.

The explainer’s role is to articulate the agent’s roles, capabilities, limitations, the rationale behind its intermediate or final outputs [61], and ethical or legal implications [62]. However, verifying the explanations can be challenging. The external systems, including tools, agents, FMs, can be associated with an AIBOM that records their supply chain details, including AI risk metrics or verifiable responsible AI credentials [63, 64]. The procurement information can be maintained in an AIBOM registry. The agent can refuse to run the external tools, agents or FMs if there is questionable provenance. As there will be more rapid releases about fine-tuned FMs, there is a trend that multiple FM variations co-exist and are serviced at the same time. The co-versioning registry can be applied to co-version the AI components, such as FMs and fine-tuned FMs [5].

3.6 AI models

When designing the architecture of FM-based applications, including agents, one of the most critical decisions is choosing whether to use an external FM, or a fine-tuned FM [65, 42], or build a sovereign FM in-house from scratch [4, 5]. Using an external FM can be cost-effective and can potentially result in higher accuracy and generalisability to a wider range of tasks. To better perform domain-specific tasks, the parameters of an external FM can be fine-tuned using domain-specific data. Training a sovereign FM in-house from scratch provides complete control over the model pipeline to ensure security and help keep competitive advantage in some domains. However, this requires significant investments in terms of cost and resources. N-version programming [65, 66, 56] can improve the reliability of the agent by using different versions of FMs as basis for its reasoning.

4 Evaluation

We evaluate the completeness and utility of the proposed reference architecture by mapping it to two real-world agents: MetaGPT[6] and HuggingGPT [7]. MetaGPT enables multi-agent collaborations to streamline the software engineering workflow. HuggingGPT leverages FMs to connect different AI models based on their function descriptions found in the Hugging Face machine learning community to address various tasks. The interactions between humans and MetaGPT are sent through the dialogue interface. The persona creator is realised by creating specialised roles based on the Role class. The passive goal creator is implemented by incorporating the goal into the specialised role. The prompt generator formulates a structured prompt template for each action conforming to role-specific standards. Both short-term memory and long-term memory are integrated into the design. Each agent is created using role-specific prompts to establish a role context. An individual memory cache is maintained by each agent to index subscribed messages by their content, sender, and recipient, while a shared memory pool is designed for the environment. Single-path plan generation is applied through increment model querying. Complex tasks are broken down into smaller, manageable sub-tasks for individual agents to address. Key information is extracted from the environment, stored in memory, and employed to inform reasoning and subsequent actions. A feedback mechanism continuously refines the code using its own memory on historical execution and debugging. Diverse roles are assigned to various agents and establish effective role-based cooperation. Task executor uses role-specific interests to gather relevant information and executes its action once it has received all the necessary prerequisites. Different roles are allocated by tool/agent generator to various agents with unique skills, each contributing specialised outputs for particular tasks. Guardrails define the operational boundaries or constraints that the role must follow during the execution of actions. The explainer displays the task-related thoughts and the resulting artifacts on the interface. The external FM, GPT4-32k, serves as the underlying FM.

Table 1: Mapping components in the architecture of MetaGPT and HuggingGPT.

Component	MetaGPT	HuggingGPT
Persona creator	Specialised roles can be created from the fundamental Role class.	N/A
Dialogue interface	Receiving inputs sent by humans and responds to it.	Generating responses to address user requests.
Passive goal creator	The goal included in the specialised role represents the main objective that the role aims to achieve.	User requests include complex intents that can be interpreted as their intended goals.
Prompt generator	Each action is provided with a prompt template that conforms to the standards for the role.	A task template is provided to guide the FM to analyse user requests and parse tasks accordingly through field filling.
Short-term memory	Each role is created using specialised role prompts to establish a role context.	Model descriptions are integrated into prompts, allowing the FM to select the appropriate models for task execution.
Long-term memory	An individual memory cache is maintained by each role, while a shared memory pool is designed for the environment.	N/A
Single-path plan generation	Complex tasks are broken into smaller, manageable sub-tasks for individual agents to complete.	User requests are decomposed into a set of structured tasks with dependencies and execution orders.
Increment model querying	Significant information is extracted from the environment, stored in memory, and used to guide reasoning and subsequent actions.	Generating results by recursively querying the FM.
Self-reflection	A feedback mechanism debugs and executes code at runtime.	N/A
Role-based cooperation	Establishing collaboration among multiple roles.	N/A
Task executor	Each agent extracts relevant information and executes its action once it has received all the necessary prerequisites.	Running each selected model and delivering the results.
Tool/agent generator	Allocating different roles to various agents with unique skills.	N/A
Tool/agent detector	N/A	Identifying relevant models based on their description.
Guardrails	The constraints specify limitations or rules the role must follow during the execution of actions.	N/A
Black box recorder	N/A	The workflow logs and chat history are maintained.
Explainer	The interface displays task-specific thoughts and output artifacts.	A summary of logs is generated for the user.
External FM	GPT4-32k.	GPT-3.5-turbo, text-davinci-003 and GPT-4.

HuggingGPT actively generates responses that address user requests through the dialogue interface. The passive goal creator is implemented by interpreting user’s intents as clearly defined goals. The prompt generator creates a task template to direct the FM in processing user requests and structuring tasks via field filling. HuggingGPT only supports short-term memory. Model descriptions are incorporated into prompts, enabling the FM to choose the appropriate models for solving tasks. Single-path plan generation coupled with increment model querying is employed for planning. User requests are analysed and decomposed into an ordered sequence of tasks, each with its own prerequisites. The steps are determined by recursively querying the FM. Tool/agent detector identifies the relevant AI models using the model descriptions. Task executor runs each selected model and produces the results. The black box recorder maintains the workflow logs, while the explainer synthesises the logs into a comprehensive summary for the user. HuggingGPT leverages external FMs, including GPT-3.5-turbo, text-davinci-003 and GPT-4, which serve as primary models in its experiments.

Our reference architecture is complete and usable, as two agent architectures can be mapped on the proposed reference architecture. The application of different pattern-oriented components leads to the variability of organisation-specific architectures. We observed that the fundamental components in an agent’s architecture include prompt engineering, memory, planning, execution engine, and RAI plugins. There are different design options for the components within those fundamental components. For example, MetaGPT employs both short-term and long-term memory, while HuggingGPT only adopts short-term memory. MetaGPT involves the design with multiple agents. Thus, the interaction with other agent components is needed in the execution engine to support cooperation. Apart from the role-based cooperation pattern employed by MetaGPT, there are two other options: voting-based cooperation and debate-based cooperation. Both MetaGPT and HuggingGPT have incorporated responsible AI patterns, including guardrails, black box recorder, explainer. For context management, neither employs a proactive goal creator that uses multimodal context information.

5 Related Work

The launch of OpenAI’s ChatGPT [67] in November 2022 set a significant milestone for GenAI, gaining over 100 million users within two months of its release. This triggered an arms race among big tech companies to develop FM-based GenAI products. Google responded with its own GenAI product, Bard⁶⁶6https://bard.google.com/chat. By February 2023, Microsoft had already integrated GPT-4 into its search engine, Bing. One notable type of FM-based systems is agents. Recently, many researchers have utilised FM as the foundation to build AI agents[11, 6]. Some studies present the architecture of their agent [22, 68]. However, these architectures often focuses only on certain components. For example, Packer et al. [22] mainly concentrate on the memory design of the agent. There’s a lack of a holistic view in architecture design, making it difficult for practitioners to design their own agents. Two recent survey papers have performed a comprehensive survey on FM-based agents and presented frameworks for designing them [2, 3]. However, both papers do not present the methodology, which might result in important studies being missed. Also, according to Bass’s definition [69], software architecture comprises software elements, relations among them, and properties of both. These frameworks only list the high-level components supporting their functionality. There is a lack of system-level thinking, with no explicit identification of software components, relationships among them, and their properties. A reference architecture is needed for practitioners to use as a template and adapt it to design their own version of agents.

6 Conclusion

In this paper, we present a pattern-oriented reference architecture which functions as an architecture design template and guides the design of FM-based agents. We evaluate the correctness and utility of our proposed reference architecture by mapping it to the architecture of two existing real-world agents. We plan to develop decision models for the selection of patterns to further assist the design of FM-based agents.

References

[1] R. Bommasani, D. A. Hudson, E. Adeli, R. Altman, S. Arora, S. von Arx, M. S. Bernstein, J. Bohg, A. Bosselut, E. Brunskill et al., “On the opportunities and risks of foundation models,” arXiv:2108.07258, 2021.
[2] Z. Xi, W. Chen, X. Guo, W. He, Y. Ding, B. Hong, M. Zhang, J. Wang, S. Jin, E. Zhou et al., “The rise and potential of large language model based agents: A survey,” arXiv:2309.07864, 2023.
[3] L. Wang, C. Ma, X. Feng, Z. Zhang, H. Yang, J. Zhang, Z. Chen, J. Tang, X. Chen, Y. Lin et al., “A survey on large language model based autonomous agents,” arXiv:2308.11432, 2023.
[4] Q. Lu, L. Zhu, X. Xu, Z. Xing, and J. Whittle, “A taxonomy of foundation model based systems through the lens of software architecture,” CAIN’24, 2024.
[5] ——, “Towards responsible ai in the era of chatgpt: A reference architecture for designing foundation model-based ai systems,” arXiv:2304.11090, 2023.
[6] S. Hong, X. Zheng, J. Chen, Y. Cheng, C. Zhang, Z. Wang, S. K. S. Yau, Z. Lin, L. Zhou, C. Ran et al., “Metagpt: Meta programming for multi-agent collaborative framework,” arXiv:2308.00352, 2023.
[7] Y. Shen, K. Song, X. Tan, D. Li, W. Lu, and Y. Zhuang, “Hugginggpt: Solving ai tasks with chatgpt and its friends in huggingface,” arXiv:2303.17580, 2023.
[8] M. Galster and P. Avgeriou, “Empirically-grounded reference architectures: a proposal,” in QoSA’11, 2011, pp. 153–158.
[9] Z. Liu, W. Yao, J. Zhang, L. Xue, S. Heinecke, R. Murthy, Y. Feng, Z. Chen, J. C. Niebles, D. Arpit et al., “Bolaa: Benchmarking and orchestrating llm-augmented autonomous agents,” arXiv:2308.05960, 2023.
[10] Z. Liu, Y. Zhang, P. Li, Y. Liu, and D. Yang, “Dynamic llm-agent network: An llm-agent collaboration framework with agent team optimization,” arXiv:2310.02170, 2023.
[11] Z. He, H. Wu, X. Zhang, X. Yao, S. Zheng, H. Zheng, and B. Yu, “Chateda: A large language model powered autonomous agent for eda,” in MLCAD’23, 2023, pp. 1–6.
[12] Y. Xia, M. Shenoy, N. Jazdi, and M. Weyrich, “Towards autonomous system: flexible modular production system enhanced with large language model agents,” arXiv:2304.14721, 2023.
[13] Y. Liu, S. Chen, H. Chen, M. Yu, X. Ran, A. Mo, Y. Tang, and Y. Huang, “How ai processing delays foster creativity: Exploring research question co-creation with an llm-based agent,” arXiv:2310.06155, 2023.
[14] S. S. Kannan, V. L. Venkatesh, and B.-C. Min, “Smart-llm: Smart multi-agent robot task planning using large language models,” arXiv:2309.10062, 2023.
[15] X. Zeng, X. Wang, T. Zhang, C. Yu, S. Zhao, and Y. Chen, “Gesturegpt: Zero-shot interactive gesture understanding and grounding with large language model agents,” arXiv:2310.12821, 2023.
[16] D. Zhao, Z. Xing, X. Xia, D. Ye, X. Xu, and L. Zhu, “Seehow: Workflow extraction from programming screencasts through action-aware video analytics,” arXiv:2304.14042, 2023.
[17] C. Zhang, K. Yang, S. Hu, Z. Wang, G. Li, Y. Sun, C. Zhang, Z. Zhang, A. Liu, S.-C. Zhu et al., “Proagent: Building proactive cooperative ai with large language models,” arXiv:2308.11339, 2023.
[18] Z. Wang, S. Mao, W. Wu, T. Ge, F. Wei, and H. Ji, “Unleashing cognitive synergy in large language models: A task-solving agent through multi-persona self-collaboration,” arXiv:2307.05300, 2023.
[19] G. Li, H. A. A. K. Hammoud, H. Itani, D. Khizbullin, and B. Ghanem, “Camel: Communicative agents for” mind” exploration of large scale language model society,” arXiv:2303.17760, 2023.
[20] A. Zhao, D. Huang, Q. Xu, M. Lin, Y.-J. Liu, and G. Huang, “Expel: Llm agents are experiential learners,” arXiv:2308.10144, 2023.
[21] R. Schumann, W. Zhu, W. Feng, T.-J. Fu, S. Riezler, and W. Y. Wang, “Velma: Verbalization embodiment of llm agents for vision and language navigation in street view,” arXiv:2307.06082, 2023.
[22] C. Packer, V. Fang, S. G. Patil, K. Lin, S. Wooders, and J. E. Gonzalez, “Memgpt: Towards llms as operating systems,” arXiv:2310.08560, 2023.
[23] Y. Jin, X. Shen, H. Peng, X. Liu, J. Qin, J. Li, J. Xie, P. Gao, G. Zhou, and J. Gong, “Surrealdriver: Designing generative driver agent simulation framework in urban contexts based on large language model,” arXiv:2309.13193, 2023.
[24] H. Zhang, W. Du, J. Shan, Q. Zhou, Y. Du, J. B. Tenenbaum, T. Shu, and C. Gan, “Building cooperative embodied agents modularly with large language models,” arXiv:2307.02485, 2023.
[25] Y. Ye, X. Cong, S. Tian, J. Cao, H. Wang, Y. Qin, Y. Lu, H. Yu, H. Wang, Y. Lin et al., “Proagent: From robotic process automation to agentic process automation,” arXiv:2311.10751, 2023.
[26] J. Wei, X. Wang, D. Schuurmans, M. Bosma, F. Xia, E. Chi, Q. V. Le, D. Zhou et al., “Chain-of-thought prompting elicits reasoning in large language models,” NeurIPS’22, vol. 35, pp. 24 824–24 837, 2022.
[27] Z. Wang, Z. Liu, Y. Zhang, A. Zhong, L. Fan, L. Wu, and Q. Wen, “Rcagent: Cloud root cause analysis by autonomous agents with tool-augmented large language models,” arXiv:2310.16340, 2023.
[28] X. Wang, J. Wei, D. Schuurmans, Q. Le, E. Chi, S. Narang, A. Chowdhery, and D. Zhou, “Self-consistency improves chain of thought reasoning in language models,” arXiv:2203.11171, 2022.
[29] S. Yao, D. Yu, J. Zhao, I. Shafran, T. L. Griffiths, Y. Cao, and K. Narasimhan, “Tree of thoughts: Deliberate problem solving with large language models,” arXiv:2305.10601, 2023.
[30] J. S. Park, J. O’Brien, C. J. Cai, M. R. Morris, P. Liang, and M. S. Bernstein, “Generative agents: Interactive simulacra of human behavior,” in UIST’23, 2023, pp. 1–22.
[31] S. Yao, J. Zhao, D. Yu, N. Du, I. Shafran, K. Narasimhan, and Y. Cao, “React: Synergizing reasoning and acting in language models,” arXiv:2210.03629, 2022.
[32] A. Madaan, N. Tandon, P. Gupta, S. Hallinan, L. Gao, S. Wiegreffe, U. Alon, N. Dziri, S. Prabhumoye, Y. Yang et al., “Self-refine: Iterative refinement with self-feedback,” arXiv:2303.17651, 2023.
[33] T. Sumers, S. Yao, K. Narasimhan, and T. L. Griffiths, “Cognitive architectures for language agents,” arXiv:2309.02427, 2023.
[34] N. Shinn, B. Labash, and A. Gopinath, “Reflexion: an autonomous agent with dynamic memory and self-reflection,” arXiv:2303.11366, 2023.
[35] P.-L. Chen and C.-S. Chang, “Interact: Exploring the potentials of chatgpt as a cooperative agent,” arXiv:2308.01552, 2023.
[36] Y. Talebirad and A. Nadiri, “Multi-agent collaboration: Harnessing the power of intelligent llm agents,” arXiv:2306.03314, 2023.
[37] W. Huang, F. Xia, T. Xiao, H. Chan, J. Liang, P. Florence, A. Zeng, J. Tompson, I. Mordatch, Y. Chebotar et al., “Inner monologue: Embodied reasoning through planning with language models,” arXiv:2207.05608, 2022.
[38] G. Sarch, Y. Wu, M. J. Tarr, and K. Fragkiadaki, “Open-ended instructable embodied agents with memory-augmented large language models,” arXiv:2310.15127, 2023.
[39] G. Chen, S. Dong, Y. Shu, G. Zhang, J. Sesay, B. F. Karlsson, J. Fu, and Y. Shi, “Autoagents: A framework for automatic agent generation,” arXiv:2309.17288, 2023.
[40] N. Nascimento, P. Alencar, and D. Cowan, “Self-adaptive large language model (llm)-based multiagent systems,” arXiv:2307.06187, 2023.
[41] J. Ruan, Y. Chen, B. Zhang, Z. Xu, T. Bao, G. Du, S. Shi, H. Mao, X. Zeng, and R. Zhao, “Tptu: Task planning and tool usage of large language model-based ai agents,” arXiv:2308.03427, 2023.
[42] Y. Kong, J. Ruan, Y. Chen, B. Zhang, T. Bao, S. Shi, G. Du, X. Hu, H. Mao, Z. Li et al., “Tptu-v2: Boosting task planning and tool usage of large language model-based agents in real-world systems,” arXiv:2311.11315, 2023.
[43] T. Xie, F. Zhou, Z. Cheng, P. Shi, L. Weng, Y. Liu, T. J. Hua, J. Zhao, Q. Liu, C. Liu et al., “Openagents: An open platform for language agents in the wild,” arXiv:2310.10634, 2023.
[44] G. Wang, Y. Xie, Y. Jiang, A. Mandlekar, C. Xiao, Y. Zhu, L. Fan, and A. Anandkumar, “Voyager: An open-ended embodied agent with large language models,” arXiv:2305.16291, 2023.
[45] C. Qian, C. Han, Y. R. Fung, Y. Qin, Z. Liu, and H. Ji, “Creator: Disentangling abstract and concrete reasonings of large language models through tool creation,” arXiv:2305.14318, 2023.
[46] M. Chen, J. Tworek, H. Jun, Q. Yuan, H. P. d. O. Pinto, J. Kaplan, H. Edwards, Y. Burda, N. Joseph, G. Brockman et al., “Evaluating large language models trained on code,” arXiv:2107.03374, 2021.
[47] S. Hamilton, “Blind judgement: Agent-based supreme court modelling with gpt,” arXiv:2301.05327, 2023.
[48] C. Qian, X. Cong, C. Yang, W. Chen, Y. Su, J. Xu, Z. Liu, and M. Sun, “Communicative agents for software development,” arXiv:2307.07924, 2023.
[49] Y. Li, Y. Zhang, and L. Sun, “Metaagents: Simulating interactions of human behaviors for llm-based task-oriented coordination via collaborative generative agents,” arXiv:2310.06500, 2023.
[50] J. Chen, S. Yuan, R. Ye, B. P. Majumder, and K. Richardson, “Put your money where your mouth is: Evaluating strategic planning and execution of llm agents in an auction arena,” arXiv:2310.05746, 2023.
[51] Q. Zhao, J. Wang, Y. Zhang, Y. Jin, K. Zhu, H. Chen, and X. Xie, “Competeai: Understanding the competition behaviors in large language model-based agents,” arXiv:2310.17512, 2023.
[52] Y. Du, S. Li, A. Torralba, J. B. Tenenbaum, and I. Mordatch, “Improving factuality and reasoning in language models through multiagent debate,” arXiv:2305.14325, 2023.
[53] T. Liang, Z. He, W. Jiao, X. Wang, Y. Wang, R. Wang, Y. Yang, Z. Tu, and S. Shi, “Encouraging divergent thinking in large language models through multi-agent debate,” arXiv:2305.19118, 2023.
[54] H. Chen, W. Ji, L. Xu, and S. Zhao, “Multi-agent consensus seeking via large language models,” arXiv:2310.20151, 2023.
[55] X. Tang, A. Zou, Z. Zhang, Y. Zhao, X. Zhang, A. Cohan, and M. Gerstein, “Medagents: Large language models as collaborators for zero-shot medical reasoning,” arXiv:2311.10537, 2023.
[56] Q. Lu, L. Zhu, X. Xu, J. Whittle, D. Zowghi, and A. Jacquet, “Responsible ai pattern catalogue: A collection of best practices for ai governance and engineering,” CSUR, 2022.
[57] S. U. Lee, H. Perera, B. Xia, Y. Liu, Q. Lu, L. Zhu, O. Salvado, and J. Whittle, “Qb4aira: A question bank for ai risk assessment,” arXiv:2305.09300, 2023.
[58] B. Xia, Q. Lu, L. Zhu, S. U. Lee, Y. Liu, and Z. Xing, “From principles to practice: An accountability metrics catalogue for managing ai risks,” arXiv:2311.13158, 2023.
[59] M. Abbasian, I. Azimi, A. M. Rahmani, and R. Jain, “Conversational health agents: A personalized llm-powered agent framework,” arXiv:2310.02374, 2023.
[60] Z. Yang, S. S. Raman, A. Shah, and S. Tellex, “Plug in the safety chip: Enforcing constraints for llm-driven robot agents,” arXiv:2309.09919, 2023.
[61] Z. Wang, S. Cai, A. Liu, X. Ma, and Y. Liang, “Describe, explain, plan and select: Interactive planning with large language models enables open-world multi-task agents,” arXiv:2302.01560, 2023.
[62] S. Schwartz, A. Yaeli, and S. Shlomov, “Enhancing trust in llm-based ai automation agents: New considerations and future challenges,” arXiv:2308.05391, 2023.
[63] B. Xia, D. Zhang, Y. Liu, Q. Lu, Z. Xing, and L. Zhu, “Trust in software supply chains: Blockchain-enabled sbom and the aibom future,” arXiv:2307.02088, 2023.
[64] B. Xia, T. Bi, Z. Xing, Q. Lu, and L. Zhu, “An empirical study on software bill of materials: Where we stand and the road ahead,” arXiv:2301.05362, 2023.
[65] B. Y. Lin, Y. Fu, K. Yang, P. Ammanabrolu, F. Brahman, S. Huang, C. Bhagavatula, Y. Choi, and X. Ren, “Swiftsage: A generative agent with fast and slow thinking for complex interactive tasks,” arXiv:2305.17390, 2023.
[66] M. Nafreen, S. Bhattacharya, and L. Fiondella, “Architecture-based software reliability incorporating fault tolerant machine learning,” in RAMS’20, 2020, pp. 1–6.
[67] OpenAI, “Gpt-4 technical report,” 2023.
[68] F. Costantino, S. Colabianchi, and A. Tedeschi, “Human-technology integration with industrial conversational agents: A conceptual architecture and a taxonomy for manufacturing,” Available at SSRN 4285483.
[69] L. Bass, P. Clements, and R. Kazman, Software architecture in practice. Addison-Wesley Professional, 2003.

Towards Responsible Generative AI: A Reference Architecture for Designing Foundation Model based Agents