
Guidance for Successful Generative AI Prototypes & Projects
Tips on how to select a use case and design a project for a generative AI prototype or project, based on experience with several financial service clients
Steven Brucato
Amazon Employee
Published Oct 18, 2024
I work with many clients who have evaluated generative AI for various use-cases here at AWS, mainly in financial services. Many have completed initial prototypes and are finding that transitioning these into production is often more challenging than anticipated while others have found more rapid success. Based on those experiences, I’ve gathered some best practices to consider for your generative AI prototype or production project. Apologies if some of these seem obvious, but I’ve found it’s often useful to reassert the basics especially for senior business decision makers new to generative AI.
1. Work backwards from a real need for a real user
• Identify a data-driven use case that is a challenge for your business today
• Work with the primary stakeholder experiencing this challenge and work backwards from their needs
2. Your data is your differentiator
• Select a use-case that is data-driven
• Identify data sets that that differentiate your business – select key internal data assets over publicly available data
• Especially consider unstructured data (e.g. PDFs, emails, audio, video, images, ….) as generative AI offers programmatic extraction and interpretation where traditional analytics methods struggle
• Don’t be afraid to go big on data set size. A use case that can scale to larger data sets will demonstrate more value.
3. List the questions and expected answers your solution should support
• It’s likely a given that your PoC will include a natural language interface for an end user. List the questions that your solution should be able to answer as well as the expected answer for each
• Identify the format of the answer – text, graphical, numeric, …
• Identify the likely data sources required for each question to provide accurate answers
• Where quantitative data inputs or quantitative responses are expected, be prepared to put in extra effort on prompt engineering or embedded code – LLMs are less-great with numeric computations than with text processing. Useful reference here.
4. Define success criteria & create metrics
• Define what a successful prototype looks like and define measurable KPI. Avoid qualitative benefits, strive for measurable KPI quantitative benefits.
• KPIs might include cost reduction, revenue improvement, accuracy improvement, false positive and false negative reduction, etc….
• Establish a cross-disciplinary team that will evaluate results and measure the results of the PoC
5. Consider production implications early
• A major cause of slow to transition into production is data security. Consider secure data access and data privacy early – anticipate how these will be addressed in production
• For regulated industries such as financial services, you will likely have to document several aspects of your solution for compliance if you transition your solution into production. Such measures include Fairness, Explainability, Privacy & Security, Controllability, Veracity & Robustness, Governance, and Transparency. More info here.
• There are also current and emerging regulations to be considered such as the EU AI Act as well as many data privacy policies which apply
6. Prepare to put focused effort into “behind-the-scenes” prompt engineering
• When using a pre-trained LLM with access to one or more data sets, you will need to provide your solution the context and detailed guidance. These instructions are inserted along with each prompt from a user.
• Defining a role, context, and other detailed guidance in these injected prompts are critical to getting accurate and consistent answers – an avoiding select topic and hallucinations
• Agentic flows, where the LLM is providing some reasoning in how to break down complex questions, will need special attention and testing. You may find some unexpected behaviors initially.
In working with clients, I provide them with guidance on these points and often provide a spreadsheet to help them get started with their thinking. A list of applicable regulatory policies is also useful. Below is a list of AWS references you may find useful:
Use Case: Portfolio Analysis
Primary User: Portfolio Manager
Success Criteria: 1.) Improved returns and reduced risk measured at 3 mos, 2.) reduction in time spent in portfolio analysis (hrs/week), 3.) improved justification of investment strategies in clear text with supporting quant data (per expert review)
Prompt Guidance: You are an expert in portfolio analysis and design. You will combine quantitative and qualitative data to provide answers within the context of a financial advisor. You are aware of applicable financial regulations and will cite these as necessary as part of your responses. You are aware of internal corporate guidance including investment policies and data privacy policies and will respect these limitations in all of your responses - citing the applicable policy where appropriate.
Questions | Response & Format | Data Source |
---|---|---|
What are the notable trends for the sectors contained in the portfolio? | Textual summary, with important quantitative numbers included in the text | List of stocks in the portfolio, historical trade data, doc repository including expert calls, historical & current news, SEC filings, board mtg minutes, broker content, internal research, applicable regulatory documents, internal policy documents, internal data privacy policies |
What are the most significant risks facing the stocks in this portfolio? | Textual response with supporting metrics | same |
What are macro-economic factors that are most significant to the portfolio’s results in the past 12 mos?What are macro-economic factors that are most significant to the portfolio’s results in the past 12 mos? | Textual response with supporting metrics | same |
According to the information in the document repository, which sector will perform the best in the coming year? | Textual response with supporting metrics | same |
Which stocks not included in the current portfolio, but within the same sectors, should be considered for this portfolio given the investment goals? Justify your answers. | List of ticker symbols with supporting arguments and key metrics | + Security master of allowable stock for this portfolio |
What stock sectors are represented in this portfolio? | List of sectors with supporting arguments and key metrics | same |
Which stock in this portfolio is contributing the smallest gains over the last 4 years? | A ticker symbol with supporting arguments and key metrics | same |
Which are contributing the largest gains? | List of ticker symbols with supporting arguments and key metrics | same |
Which stock in this portfolio has the highest volatility over the last 6 mos? | A ticker symbol with supporting arguments and key metrics | same |
Which stocks would you add or remove to reduce volatility to a minimum given any other stocks in the same sectors as the those currently contained in the portfolio? | List of ticker symbols with supporting arguments and key metrics | +Investment goals for portfolio (document) |
Which stocks would you add and remove to maximize return regardless of volatility, given the investment goals? | List of ticker symbols with supporting arguments and key metrics | same |
Which stocks have financial results over the last 6 mos that contradict the expert analyses from 6 mos ago? | List of authors with summary of their correct predictions, list of applicable tickers, quant data | same |
Summarize the information of the most accurate expert analyses. | Textual summary (including quant data) for each authorTextual summary (including quant data) for each author | same |
Of the most accurate experts, what sectors or stock are likely to perform best over the next 6 mos based on information from those experts? | List of sectors with supporting arguments, citing author content and summarizing their reasons | same |
What has been the performance of that sector over the last year? Display the result in a line chart. | Line chart for sector with appropriate labeling | same |
Adjust the quantitative model predicted performance of the portfolio based on the information contained in the document repository and display the result in a line chart with OHLC predictions. Provide a summary of magnitude of the adjustments as well as summary of what data elements from the document had the most bearing on the adjustments.Adjust the quantitative model predicted performance of the portfolio based on the information contained in the document repository and display the result in a line chart with OHLC predictions. Provide a summary of magnitude of the adjustments as well as summary of what data elements from the document had the most bearing on the adjustments. | Line chart of original quant predictions, the updated predictions, and a textual summary citing what document content impacted the results and along with the quantitative magnitude of each factor | + quant model predictions for the portfolio |
List the stocks in the portfolio for which the experts and social media disagree on future valuation and summarize the arguments of each. | List of ticker symbols with supporting arguments and key metrics | +social media data |
List the stocks which are most exposed to potential retail activist trader price impacts and justify your reasons. | List of ticker symbols with supporting arguments and key metrics | same |
Any opinions in this post are those of the individual author and may not reflect the opinions of AWS.