SAM is a reference implementation of Tool4AI project https://github.com/vishalmysore/Tools4AI Basically showcasing how straight forward it is to build action oriented applications in 100% Java. In addition to action model SAM can be used as an autonomous agent by utilizing action scripts which are a specialized form of these intelligent systems, designed specifically for enterprise AI applications. While retaining the core capabilities of autonomy and adaptability, scripts can operates within a controlled framework, executing tasks and making decisions that align with predefined business rules and objectives
- Action Processor: Execute actions based on prompt (OpenAI, Gemini, Anthropic) here
- Image Processor: Trigger actions based on images here
- Autonomous Agent: Execute tasks based on scripts here
- Image to Text: Convert images to texthere
- Image to Pojo: Convert images to Pojo here
- Image to Json: Convert images to Json here
- Image to XML: Convert images to XMLhere
- Selenium Actions: Execute actions based on Selenium script here
- Spring Integration: Integrate with Spring Boot here
- Script Processor: Execute tasks based on scripts here
- Http Actions: Execute actions based on HTTP requests here
- Prompt Processor: Execute tasks based on prompts here
- Prompt Transformer: Convert prompts into various formats ( Java , Json , XML) here
- Custom GSON: Convert special values in prompts here
- Subprompt Processing: Break prompts into multiple subprompts here
- Human In Loop Validation here
- Explainablity here
- Kubernets Integration [here]( here
- Multi Command Processor here
- Hallucination Detector here
- Bias Detector here
- Database Actions
- Tibco Actions
- Custom Actions
- Custom HTTP Actions
- Custom Shell Actions
- Custom Swagger Actions
- Custom OpenAI Actions
- Custom Selenium Actions
Clone this project and then
mvn clean install
Action Processors are responsible for taking actions based on prompt. Actions can be written in Java Methods, Or could be
HTTP rest end points , could be Shell scripts or could be loaded directly from the Swagger HTTP configurations.
Inside Main.java
these 2 lines will predict the action and execute it using Gemini , you dont have to worry
about specifying the action, the action will be picked up based on Natural Language Processing semantic mapping
and will be executed.
String cookPromptSingleText = "My friends name is Vishal ," +
"I dont know what to cook for him today.";
GeminiActionProcessor processor = new GeminiActionProcessor();
String result = (String)processor.processSingleAction(cookPromptSingleText);
log.info(result);
This code will use OpenAI to predict the action and execute it
OpenAiActionProcessor opeAIprocessor = new OpenAiActionProcessor();
Sring result = (String)opeAIprocessor.processSingleAction('My friends name is Vishal ,he lives in tornto.I want save this info locally');
System.out.println(result);
Create custom action by using @Predict annotation and @Action annotation. Parameters of the method can be anything and any number of parameters are allowed You need to make sure parameters have meaningful name.
@Predict
public class SimpleAction {
@Action(description = "Provide persons name and then find out what does that person like")
public String whatFoodDoesThisPersonLike(String name) {
if("vishal".equalsIgnoreCase(name))
return "Paneer Butter Masala";
else if ("vinod".equalsIgnoreCase(name)) {
return "aloo kofta";
}else
return "something yummy";
}
}
or
@Log
@Predict
public class SearchAction {
@Action(description = "Search the web for information")
public String googleSearch(String searchString, boolean isNews) {
log.info(searchString+" : "+isNews);
HttpResponse<String> response = Unirest.post("https://google.serper.dev/search")
.header("X-API-KEY", PredictionLoader.getInstance().getSerperKey())
.header("Content-Type", "application/json")
.body("{\"q\":\""+searchString+"\"}")
.asString();
String resStr = response.getBody().toString();
return resStr;
}
}
Or add actions in Shell or HTTP config files
Trigger actions based directly on images! Yes, you read that right โ with the power of Java, we can now integrate function calling with image inputs.
Imagine a system so advanced that it can:
๐ Call an ambulance immediately after detecting an image of a car accident.
๐ณ Suggest recipes the moment it sees images of vegetables.
๐ฎ Alert the police when it captures an image of a traffic signal violation.
๐ Contacts the fire department immediately if it "sees" fire.
public class ImageActionExample {
public static void main(String[] args) throws AIProcessingException {
GeminiImageActionProcessor processor = new GeminiImageActionProcessor();
String imageDisription = processor.imageToText(args[0]);
GeminiV2ActionProcessor actionProcessor = new GeminiV2ActionProcessor();
Object obj = actionProcessor.processSingleAction(imageDisription);
String str = actionProcessor.summarize(imageDisription+obj.toString());
System.out.println(str);
}
}
If you have a complete script written in English , ScriptProcessor will process the script and provide consolidated results
ScriptProcessor script = new ScriptProcessor();
ScriptResult result = script.process("complexTest.action");
String resultsString = script.summarize(result)
log.info(resultsString)
Sample script is here
can you reserve the flight for Vishal from Toronto to Bangalore for 3 Days on 7th december
If flight booking is successful, can you reserve the car for Vishal from Bangalore to Toronto for 10 Days on 17th december
if car booking is successful and flight cost are less than $1000 then book the sight seeing attraction called 5 star palace
if car booking is successful and flight cost are more than $1000 then book the sight seeing attraction called peanut palace
You can add Human In Loop validation , Explainablity , Multi Command Processor, Hallucination Detector , Bias Detector , Database and Tibco actions as well please look at https://github.com/vishalmysore/Tools4AI for more information
Prompt Transformer, a core feature in the Tools4AI project, simplifies data transformation tasks. It effortlessly converts prompts into various formats like Java POJOs, JSON strings, CSV files, and XML. By enabling direct conversion of prompts into domain-specific objects, Prompt Transformer streamlines data processing tasks. It offers flexibility and ease of use for transforming data structures to meet diverse needs in modern applications.
Lets take the first scenario where you want to conver the prompt directly into Java Bean or Pojo
PromptTransformer builder = new PromptTransformer();
String promptTxt ="Sachin Tendulkar is very good cricket player, " +
"he joined the sports on 24032022, he has played 300 matches " +
"and his max score is 400";
//Convert the prompt to Pojo
Player player = (Player)builder.transformIntoPojo(promptTxt, Player.class.getName(),"Player","create player pojo");
log.info(player.toString());
The above will convert the prompt into this simple Pojo
import lombok.*;
import lombok.extern.java.Log;
import java.util.Date;
@NoArgsConstructor
@AllArgsConstructor
@Getter
@Setter
@EqualsAndHashCode
@ToString
public class Player {
int matches;
int maxScore;
String firstName;
String lastName;
Date dateJoined;
}
The transformer can also convert into complex Pojo ( where there are multiple objects inside the Pojo)
PromptTransformer builder = new PromptTransformer();
promptTxt = "can you book Maharaja restaurant in " +
"Toronto for 4 people on 12th may , I am Vishal ";
//Convert the prompt to Complex Pojo
RestaurantPojo pojo = (RestaurantPojo)builder.transformIntoPojo(promptTxt, RestaurantPojo.class.getName(),"RestaurantPojo","Build the pojo for restaurant");
log.info(pojo.toString());
This will create the Pojo Object of RestaurantPojo and also populate the internal objects ( not just primitive)
public class RestaurantPojo {
String name;
int numberOfPeople;
//Pojo inside Pojo
RestaurantDetails restaurantDetails;
boolean cancel;
String reserveDate;
If you expect some custom objects like Date etc in the prompt you can have custom Gson Builder
//Using Custom GSON to convert special values
GsonBuilder gsonBuilder = new GsonBuilder();
gsonBuilder.registerTypeAdapter(Date.class, new DateDeserializer("dd MMMM yyyy"));
Gson gson = gsonBuilder.create();
PromptTransformer customBuilder = new PromptTransformer(gson);
String prompt = "Sachin Tendulkar is very good cricket player, he joined the sports on 12 May 2008," +
"he has played 300 matches and his max score is 400";
player = (Player)customBuilder.transformIntoPojo(prompt, Player.class.getName(),"Player","create player pojo");
log.info(player.toString());
This will use Custom Date Serializer
public class DateDeserializer implements JsonDeserializer<Date> {
private final DateFormat dateFormat;
public DateDeserializer(String format) {
this.dateFormat = new SimpleDateFormat(format, Locale.ENGLISH);
}
@Override
public Date deserialize(JsonElement json, Type typeOfT, JsonDeserializationContext context) throws JsonParseException {
try {
return dateFormat.parse(json.getAsString().replaceAll("(st|nd|rd|th),", ","));
} catch (ParseException e) {
throw new JsonParseException(e);
}
}
}
If you want to convert the Prompt into the Json String
prompt = "Sachin Tendulkar is very good cricket player, he joined the sports on 12 May 2008," +
"he has played 300 matches and his max score is 400";
//Extract Json from the prompt
String jsonString = "{\"lastName\":\"String\",\"firstName\":\"String\"}";
jsonString = builder.transformIntoJson(jsonString,prompt,"player","get player details");
log.info(jsonString);
The result will be
"{\"lastName\":\"Tendulkar\",\"firstName\":\"Sachin\"}";
You can extract custom parameters from the prompt or can convert the entire prompt into JSON
Once you have the JSON you can convert the JSON into XML
JSONObject jsonObject = new JSONObject(jsonNode.toString());
String xmlString = XML.toString(jsonObject);
You can have a really long prompt with multiple actions , those prompts will be broken down in multiple subprompts
@Log
public class ActionExample {
public static void main(String[] args) throws AIProcessingException {
ActionProcessor processor = new ActionProcessor();
String multiPrmpt = "hey I am in Toronto do you think i can go out without jacket," +
" also save the weather information , City location and your suggestion in file, " +
"also include places to see";
String processed = processor.processMultipleActionDynamically
(multiPrmpt,
new LoggingHumanDecision(),
new LogginggExplainDecision());
log.info(processed);
}
}
Tools4AI will create a JSon from the prompt
{
"prmpt": [
{
"id": "1",
"subprompt": "What is the weather in Toronto?",
"depend_on": null
},
{
"id": "2",
"subprompt": "Do I need a jacket in this weather?",
"depend_on": "1"
},
{
"id": "3",
"subprompt": "Save the weather information, city location, and suggestion in a file.",
"depend_on": "2"
},
{
"id": "4",
"subprompt": "Suggest some places to see in Toronto.",
"depend_on": "3"
}
]
}
After that each Subprompt will be processd independently or in dependency order, if the prompts are dependent on each other then the result from previous prompt will be fed into the next one.
https://javadoc.io/doc/io.github.vishalmysore/tools4ai/latest/com/t4a/api/AIAction.html
https://repo1.maven.org/maven2/io/github/vishalmysore/tools4ai/