Author: saqibkhan

Means-Ends Analysis in Artificial Intelligence
We have studied the strategies which can reason either in forward or backward, but a mixture of the two directions is appropriate for solving a complex and large problem. Such a mixed strategy, make it possible that first to solve the major part of a problem and then go back and solve the small problems arise during combining the big parts of the problem. Such a technique is called Means-Ends Analysis.

Means-Ends Analysis is a problem-solving technique used in Artificial Intelligence for limiting search in AI programs. It is a mixture of Backward and forward search techniques. The MEA technique was first introduced in 1961 by Allen Newell and Herbert A. Simon was in their problem-solving computer program, which was named as General Problem Solver (GPS). The MEA analysis process centered on the evaluation of the difference between the current state and the goal state.

How means-ends analysis Works?

The means-ends analysis process can be applied recursively for a problem. It is a strategy to control search in problem-solving. Following are the main Steps which describes the working of MEA technique for solving a problem.
- First, evaluate the difference between the Initial State and the final State.
- Select the various operators that can be applied to each difference.
- Apply the operator at each difference, which reduces the difference between the current state and the goal state.
Operator Subgoaling

In the MEA process, we detect the differences between the current state and the goal state. Once these differences occur, we can apply an operator to reduce the differences. But sometimes it is possible that an operator cannot be applied to the current state. So we create the subproblem of the current state, in which operators can be applied, such type of backward chaining in which operators are selected, and then sub-goals are set up to establish the preconditions of the operator is called Operator Subgoaling.

Algorithm for Means-Ends Analysis:

Let’s we take Current state as CURRENT and Goal State as GOAL, then following are the steps for the MEA algorithm.
- Step 1: Compare CURRENT to GOAL, if there are no differences between both then return Success and Exit.
- Step 2: Else, select the most significant difference and reduce it by doing the following steps until the success or failure occurs.
  1. Select a new operator O which is applicable for the current difference, and if there is no such operator, then signal failure.
  2. Attempt to apply operator O to CURRENT. Make a description of two states.
    i) O-Start, a state in which O?s preconditions are satisfied.
    ii) O-Result, the state that would result if O were applied In O-start.
  3. If
    (First-Part <—— MEA (CURRENT, O-START)
    And
    (LAST-Part <—– MEA (O-Result, GOAL), are successful, then signal Success and return the result of combining FIRST-PART, O, and LAST-PART.
The above-discussed algorithm is more suitable for a simple problem and not adequate for solving complex problems.

Example of Mean-Ends Analysis:

Let’s take an example where we know the initial state and goal state as given below. In this problem, we need to get the goal state by finding differences between the initial state and goal state and applying operators.

Solution:

To solve the above problem, we will first find the differences between the initial states and goal states, and for each difference, we will generate a new state and apply the operators. The operators we have for this problem are:
- Move
- Delete
- Expand
1. Evaluating the initial state: In the first step, we will evaluate the initial state and will compare the initial and Goal state to find the differences between both states.

2. Applying Delete operator: As we can check the first difference is that in goal state there is no dot symbol which is present in the initial state, so, first we will apply the Delete operator to remove this dot.

3. Applying Move Operator: After applying the Delete operator, the new state occurs which we will again compare with goal state. After comparing these states, there is another difference that is the square is outside the circle, so, we will apply the Move Operator.

4. Applying Expand Operator: Now a new state is generated in the third step, and we will compare this state with the goal state. After comparing the states there is still one difference which is the size of the square, so, we will apply Expand operator, and finally, it will generate the goal state.

Uses of Means-Ends Analysis

Artificial intelligence (AI) systems commonly employ means-end analysis. Since the computational procedures involved in the analysis mimic some features of human cognition and problem-solving abilities, it is a crucial goal-based problem-solving technique in the development of AI systems that behave like humans. Using a combination of backward and forward search strategies, AI systems also employ means-ends analysis to limit searches within programs by assessing the difference between the problem’s current state and the desired state.

Implementation

We will now do the simulation of moving from a start number to a goal number by applying operations like add, subtract, or multiply, that will helps in understanding teh MEA.

Code:
1. # Importing libraries
2. import networkx as nx
3. import matplotlib.pyplot as plt
4. import time
6. # basic graph function
7. def createMyGraph():
8. g = nx.Graph()
10. # These are the places (nodes)
11. placeNames = [‘Start’, ‘point_A’, ‘point_B’, ‘point_C’, ‘point_D’, ‘point_E’, ‘Goal’]
12. for p in placeNames:
13. g.add_node(p)
15. # now putting the roads (edges) with distances
16. g.add_weighted_edges_from([
17. (‘Start’, ‘point_A’, 2),
18. (‘Start’, ‘point_B’, 4),
19. (‘point_A’, ‘point_C’, 2),
20. (‘point_B’, ‘point_D’, 1),
21. (‘point_C’, ‘point_E’, 3),
22. (‘point_D’, ‘point_E’, 1),
23. (‘point_E’, ‘Goal’, 2)
24. ])
26. return g
28. # used to find how far a node is from the goal
29. def distanceGuess(graphx, frm, to):
30. try:
31. dist = nx.shortest_path_length(graphx, frm, to, weight=’weight’)
32. except:
33. dist = float(‘inf’)
34. return dist
36. # The actual means-end logic goes here
37. def meaSolver(grph, st, gl):
38. cur = st
39. trail = [cur]
40. counter = 0
42. print(“=== STARTING MEA ===\n”)
44. while cur != gl:
45. neigh = list(grph.neighbors(cur))
46. best = None
47. bestH = float(‘inf’)
49. for x in neigh:
50. guess = distanceGuess(grph, x, gl)
51. if guess < bestH:
52. best = x
53. bestH = guess
55. if best is None:
56. print(“No more moves :(“)
57. return trail
59. print(f”Move {counter+1}: {cur} –> {best} (h={bestH})”)
60. cur = best
61. trail.append(cur)
62. showGraph(grph, trail, cur, gl, counter)
63. time.sleep(1)
64. counter += 1
66. print(“\nReached the goal “)
67. return trail
69. # This draws the graph at each step
70. def showGraph(g, visited, now, goal, stepN):
71. pos = nx.spring_layout(g, seed=1)
72. plt.figure(figsize=(8,6))
73. nx.draw(g, pos, with_labels=True, node_size=700, node_color=’lightblue’, edge_color=’grey’)
75. nx.draw_networkx_nodes(g, pos, nodelist=[now], node_color=’orange’, node_size=800)
76. nx.draw_networkx_nodes(g, pos, nodelist=[goal], node_color=’green’, node_size=800)
78. # show path so far
79. path_edges = list(zip(visited[:-1], visited[1:]))
80. nx.draw_networkx_edges(g, pos, edgelist=path_edges, edge_color=’blue’, width=2)
82. labels = nx.get_edge_attributes(g, ‘weight’)
83. nx.draw_networkx_edge_labels(g, pos, edge_labels=labels)
85. plt.title(f”Step {stepN+1}: at {now}”)
86. plt.axis(‘off’)
87. plt.pause(1)
88. plt.close()
90. # running it
91. myG = createMyGraph()
92. res = meaSolver(myG, ‘Start’, ‘Goal’)
94. print(“\nFinal Path:”)
95. for i, pt in enumerate(res):
96. print(f”{i+1}: {pt}”)
97. <textarea>
Output:

Move 1: Start -> point_B (h=4)

Move 2: point_B -> point_D (h=3)

Move 3: point_D -> point_E (h=2)

Move 4: point_E -> Goal (h=0)
```
Reached the goal
Final Path:
1: Start
2: point_B
3: point_D
4: point_E
5: Goal
```
Starting with the node marked “Start,” the algorithm analyzes every nearby node at each step and picks the one that seems nearest to the target depending on the estimated shortest path (heuristic). In the given instance, the path taken was Start-> point_A->point_C-> point_E-> Goal. This indicates that the agent consistently picked the direction with the least expected remaining cost, hence methodically closing the gap to the target. The graph allows easy observation of the decision-making process as the process is visibly traced with each movement emphasized.
January 20, 2026
Hill Climbing Algorithm in Artificial Intelligence
Hill climbing algorithm is a local search algorithm that continuously moves in the direction of increasing elevation/value to find the peak of the mountain or the best solution to the problem. It terminates when it reaches a peak value where no neighbor has a higher value.

It is a technique which is used for optimizing the mathematical problems. One of the widely discussed examples of Hill climbing algorithm is Traveling-salesman Problem in which we need to minimize the distance traveled by the salesman.

It is also called greedy local search as it only looks to its good immediate neighbor state and not beyond that. A node of hill climbing algorithm has two components which are state and value. Hill Climbing is mostly used when a good heuristic is available. In this algorithm, we don’t need to maintain and handle the search tree or graph, as it only keeps a single current state.

Advantages of Hill climb algorithm:

The merits of the Hill Climbing algorithm are given below.
- The first part of the paper is based on Hill’s diagrams that can easily put things together, ideal for complex optimization problem. A hill climbing algorithm is first invention of hill climbers.
- It uses much less RAM for the current problem state and the solutions located around it than comparing the algorithm to a tree search method, which will require inspecting the entire tree. Consequently, this reduces the total memory resources to be used. Space is what matters; solutions should occupy a convenient area to consume as little memory as possible.
- When it comes to the acceleration of the hill up, most of the time it brings a closure in the local maximum straight away. This is the route if having quickly getting a quick solution, outshining acquiring a global maximum, is an incentive.
Disadvantages of Hill Climbing Algorithm
- Concerning hill climbing, it seems that some solutions do not find the optimum point and remain stuck at a local peak, particularly where the optimization needs to be done in complex environments with many objective functions.
- It is also superficial because it just seeks for the surrounding solution and does not get farther than that. It could be on a wrong course which is based on a locally optimal solution, and consequently Godbole needs to move far away from current position in the first place.
- It is highly likely that the end result will largely depend on the initial setup and state, with a precedent of it being the most sensitive factor. It implies that in this case, time is the perimeter of the sphere within which people develop their skills dynamically, determining their success.
Features of Hill Climbing:

The following are some main features of the Hill Climbing Algorithm:
- Generate and Test variant: Hill Climbing is a variant of the Generate and Test method. The Generate and Test method produces feedback that helps to decide which direction to move in the search space.
- Greedy approach: Hill-climbing algorithm search moves in the direction that optimizes the cost.
- No backtracking: It does not backtrack the search space, as it does not remember the previous states.
- Deterministic Nature: Hill Climbing is a deterministic optimization algorithm, which means that given the same initial conditions and the same problem, it will always produce the same result. There is no randomness or uncertainty in its operation.
- Local Neighborhood: Hill Climbing is a technique that operates within a small area around the current solution. It explores solutions that are closely related to the current state by making small, gradual changes. This approach allows it to find a solution that is better than the current one, although it may not be the global optimum.
State-space Diagram for Hill Climbing:

The state-space landscape is a graphical representation of the hill-climbing algorithm which is showing a graph between various states of algorithm and Objective function/Cost.

On Y-axis we have taken the function which can be an objective function or cost function, and state-space on the x-axis. If the function on Y-axis is cost then, the goal of search is to find the global minimum and local minimum. If the function of Y-axis is Objective function, then the goal of the search is to find the global maximum and local maximum.

Different regions in the state space landscape:
- Local Maximum: Local maximum is a state which is better than its neighbor states, but there is also another state which is higher than it.
- Global Maximum: Global maximum is the best possible state of state space landscape. It has the highest value of objective function.
- Current State: It is a state in a landscape diagram where an agent is currently present.
- Flat local maximum: It is a flat space in the landscape where all the neighboring states of the current state have the same value.
- Shoulder: It is a plateau region that has an uphill edge.
Types of Hill Climbing Algorithm:
- Simple hill Climbing:
- Steepest-Ascent hill-climbing:
- Stochastic hill Climbing:
1. Simple Hill Climbing:

Simple hill climbing is the simplest way to implement a hill-climbing algorithm. It only evaluates the neighbor node state at a time and selects the first one that optimizes the current cost, and sets it as the current state. It only checks its one successor state, and if it finds a better state than the current state, then it moves, else it remains in the same state. This algorithm has the following features:
- Less time-consuming
- A less optimal solution, and the solution is not guaranteed
Algorithm for Simple Hill Climbing:
- Step 1: Evaluate the initial state. If it is the goal state, then return success and Stop.
- Step 2: Loop until a solution is found or there is no new operator left to apply.
- Step 3: Select and apply an operator to the current state.
- Step 4: Check the new state:
  1. If it is the goal state, then return success and quit.
  2. Else if it is better than the current state, then assign the new state as the current state.
  3. Else if not better than the current state, then return to step 2.
- Step 5:
2. Steepest-Ascent hill climbing:

The steepest ascent algorithm is a variation of the simple hill climbing algorithm. This algorithm examines all the neighboring nodes of the current state and selects one neighbor node that is closest to the goal state. This algorithm consumes more time as it searches for multiple neighbors.

Algorithm for Steepest-Ascent hill climbing:
- Step 1: Evaluate the initial state. If it is the goal state, then return success and stop; else make the current state as initial state.
- Step 2: Loop until a solution is found or the current state does not change.
  1. Let SUCC be a state such that any successor of the current state will be better than it.
  2. For each operator that applies to the current state:
    
    Apply the new operator and generate a new state.
    
    Evaluate the new state.
    
    If it is the goal state, then return it and quit; else, compare it to the SUCC.
    
    If it is better than SUCC, then set the new state as SUCC.
    
    If the SUCC is better than the current state, then set the current state to SUCC.
- Step 5:
3. Stochastic hill climbing:

Stochastic hill climbing does not examine for all its neighbor before moving. Rather, this search algorithm selects one neighbor node at random and decides whether to choose it as a current state or examine another state.

Hybrid Methods

Although Hill Climbing Algorithm is a very efficient and useful local search technique, there are several drawbacks including plateaus, ridges, and local maxima trapping. To surpass these limitations and produce better optimization solutions, experts and practitioners usually employ hybrid approaches combining hill climbing with other algorithms addressing its shortcomings.

1. Hill climbing combined with simulated annealing

Controlled randomness is introduced into the search process using a technique called simulated annealing. Simulated annealing rejects states with poorer fitness according to a random probability function; hill climbing only rejects ones that improve fitness (in this case, a higher value). Simulated annealing can push through a local peak if hill climbing reaches one by accepting a small value drop, and allow the search to explore other areas of the solution space.

2. Mountain Climbing and Genetic Algorithms

Mutation, crossover, and selection are three types of techniques used by genetic algorithms (GAs). Hill climbing is one local optimizer in genetic algorithms. After a new one is created with crossover and mutation, hill climbing can be employed to iterate or " polish” that solution for enhancement. Together, GAs provide great better quality solution and faster convergence time; hill climbing local search together can.

3. Memetic Algorithms’ Hill Climbing

Sophisticated hybrids of evolutionary algorithms, memetic algorithms combine other kinds of individual learning or local search with local refining methods like hill climbing. Before it enters the next generation, every candidate solution in Memetic Algorithms is subject to local improvement by hill climbing. Since this method is balanced, the search process is more intelligent: the GA permits exploration (by random search for mutations and crossover) and the hill climbing provides the exploitation (by local search).

Problems in Hill Climbing Algorithm:

1. Local Maximum: A local maximum is a peak state in the landscape which is better than each of its neighboring states, but there is another state also present which is higher than the local maximum.

Solution: Backtracking technique can be a solution of the local maximum in state space landscape. Create a list of the promising path so that the algorithm can backtrack the search space and explore other paths as well.

2. Plateau: A plateau is the flat area of the search space in which all the neighbor states of the current state contain the same value, because of this algorithm does not find any best direction to move. A hill-climbing search might be lost in the plateau area.

Solution: The solution for the plateau is to take big steps or very little steps while searching, to solve the problem. Randomly select a state which is far away from the current state so it is possible that the algorithm could find non-plateau region.

3. Ridges: A ridge is a special form of the local maximum. It has an area which is higher than its surrounding areas, but it has a slope, and cannot be reached in a single move.

Solution: With the use of bidirectional search, or by moving in different directions, we can improve this problem.

Simulated Annealing:

A hill-climbing algorithm that never makes a move towards a lower value is guaranteed to be incomplete because it can get stuck on a local maximum. And if algorithm applies a random walk, by moving a successor, then it may complete but not efficient. Simulated Annealing is an algorithm which yields both efficiency and completeness.

In mechanical terms, Annealing is a process of hardening a metal or glass to a high temperature, then cooling gradually, so this allows the metal to reach a low-energy crystalline state. The same process is used in simulated annealing, in which the algorithm picks a random move, instead of picking the best move. If the random move improves the state, then it follows the same path. Otherwise, the algorithm follows the path that has a probability of less than 1, or it moves downhill and chooses another path.

Applications of Hill Climbing Algorithm

The hill-climbing technique has seen widespread usage in artificial intelligence and optimization, respectively. It methodically solves those problems via coupled research activities by systematically testing options and picking out the most appropriate one. Some of the applications are as follows:

Some of the application are as follows:

1. Machine Learning:

Fine tuning of machine learning models frequently is doing the hyper parameter optimization that provides the model with guidance on how it learns and behaves. Another exercise which serves the same purpose is hill training. Gradual adjustment of hyperparameters and their evaluation according to the respectively reached the essence of the hill climbing method.

2. Robotics:

In robotics, hill climbing technique turns out to be useful for an artificial agent roaming through a physical environment where its path is adjusted before arriving at the destination.

3. Network Design:

The tool may be employed for improvement of network forms, processes, and topologies in the telecommunications industry and computer networks. This approach erases the redundancy thus the efficiency of the networks are increased by studying and adjusting their configurations. It facilitates better cooperation, efficiency, and the reliability of diverse communication system.

4. Game playing:

Altough the hill climbing can be optimal in game playing AI by developing the strategies which helps to get the maximum scores.

5. Natural language processing:

The software assists in adjusting the algorithms to enable the software to be efficient at dealing with the tasks at hand such as summarizing text, translating languages and recognizing speech. These abilities owing to it as a significant tool for many applications.

Implementation

Now we will try to maximize the function with the help of the Hill Climbing algorithm: f (x) = -x² + 5
```
# Importing the libraries  
import random  
import matplotlib.pyplot as plt  
  
# Defining the function to maximize  
def function_target(val):  
return -val**2 + 5  # A parabola with maximum at x = 0    
# Visualizing the hill-climbing algorithm   
def hill_climbing_with_plot(  
steps_max=1000, step_range=0.1, x_min=-10, x_max=10  ):  
# Starting with some random solution  
point_current = random.uniform(x_min, x_max)  
value_current = function_target(point_current)    
# Tracking the  progress for  the purpose of visualization  
history_x = &#91;point_current]  
history_y = &#91;value_current]    
for step in range(steps_max):  
    # Generating a neighboring point by adding a small random step  
    point_neighbor = point_current + random.uniform(-step_range, step_range)    
    # Ensuring that  the neighbor stays within limits  
    point_neighbor = max(min(point_neighbor, x_max), x_min)  
    value_neighbor = function_target(point_neighbor)    
    # If the neighbor is better, move to it  
    if value_neighbor > value_current:  
        point_current = point_neighbor  
        value_current = value_neighbor    
    # Recording the current best point  
    history_x.append(point_current)  
    history_y.append(value_current)    
return point_current, value_current, history_x, history_y    
# Running the algorithm  
opt_x, opt_y, path_x, path_y = hill_climbing_with_plot()  
  
# Getting the final result  
print(f"Optimal x: {opt_x:.4f}, f(x): {opt_y:.4f}")  
  
# Plotting the function and the hill climbing path  
x_vals = [x / 100.0 for x in range(-1000, 1000)]  
y_vals = [function_target(x) for x in x_vals]  
  
plt.figure(figsize=(10, 6))  
plt.plot(x_vals, y_vals, label="Target Function: f(x) = -x² + 5", color='blue')  
plt.plot(path_x, path_y, marker='o', color='red', label='Search Path')  
plt.title("Hill Climbing Optimization Path")  
plt.xlabel("x")  
plt.ylabel("f(x)")  
plt.legend()  
plt.grid(True)  
plt.show()  
<textarea></div>  
<p><strong>Output:</strong></p>  
<div class="codeblock"><pre>Optimal x: -0.0001, f(x): 5.0000</pre></div>  
<img alt="Hill Climbing Algorithm in AI" src="https://images.tpointtech.com/tutorial/ai/images/hill-climbing-algorithm-in-ai5.png" />  
<p>This shows the algorithm has found a value of x close to 0, where the function achieves its maximum.</p>  
        &lt;hr />  
        &lt;div class="nexttopicdiv">&lt;span class="nexttopictext">Next Topic&lt;/span>&lt;span class="nexttopiclink">&lt;a href="means-ends-analysis-in-ai">Means-Ends Analysis in AI&lt;/a>&lt;/span>&lt;/div>  
             
        &lt;div id="bottomnext">&lt;a class="next" href="ai-informed-search-algorithms" style="float:left">← prev&lt;/a> &lt;a class="next" href="means-ends-analysis-in-ai" style="float:right">next →&lt;/a>&lt;/div>  
        &lt;br />  
         &lt;/td>  
    &lt;/tr>  
&lt;/tbody>  </table>  
</div>  
```
January 20, 2026

A* Search Algorithm in Artificial Intelligence

A* (pronounced “A-star”) is a powerful graph traversal and pathfinding algorithm widely used in artificial intelligence and computer science. It is mainly used to find the shortest path between two nodes in a graph, given the estimated cost of getting from the current node to the destination node. The main advantage of the algorithm is its ability to provide an optimal path by exploring the graph in a more informed way compared to traditional search algorithms such as Dijkstra’s algorithm.

Algorithm A* combines the advantages of two other search algorithms: Dijkstra’s algorithm and Greedy Best-First Search. Like Dijkstra’s algorithm, A* ensures that the path found is as short as possible but does so more efficiently by directing its search through a heuristic similar to Greedy Best-First Search. A heuristic function, denoted h(n), estimates the cost of getting from any given node n to the destination node.

Formula of A* Search Algorithm

As we know, A* Search Algorithm is used to identify the most cost-effective and the shortest path from one node to another node in a graph.

Let’s take a look at the formula that we use to find the total cost of exploring the shortest path from one node to another node:

f(n)=g(n)+h(n)

Where,

f(n): The total cost that occurs from the initial node to the final state via the current node n is represented by f(n).
g(n): the actual cost to get from the initial node to node n. It represents the sum of the costs of node n outgoing edges.
h(n): Heuristic cost (also known as “estimation cost”) from node n to destination node n. This problem-specific heuristic function must be acceptable, meaning it never overestimates the actual cost of achieving the goal. The evaluation function of node n is defined as f(n) = g(n) h(n).

Algorithm A* selects the nodes to be explored based on the lowest value of f(n), preferring the nodes with the lowest estimated total cost to reach the goal. The A* algorithm works:

Key components

Let’s understand the key components of the A* Search Algorithm:

Nodes: Nodes are the points in the graph( where connections meet)
Edges: These are connections between the Nodes
Path Cost: The Cost of traversing from one node to another node.
Heuristic: Estimated cost of traversing to the final or goal node from the current node
Search Space: It is the group of all possible paths that can be explored

How does the A* Search Algorithm work?

A* search blends the principles of Dijkstra’s algorithm with those of the Greedy Best-First Search. This gives the A* search algorithms the best of both worlds.

Let’s take a quick look at the explanation of both algorithms:

Dijkstra’s Algorithm: Dijkstra’s Algorithm helps to find the minimum-cost paths from a given source node to every other node in a graph.
Greedy Best-First Search Algorithm: In the Greedy Best-First Search algorithm, the node estimated to be nearest to the goal state is chosen for expansion.

The best of both worlds combines to form A* search, which makes it more efficient and accurate than they were individually.

Step-by-Step Implementation of the A* Algorithm

Initialization: In the first step, the initial nodes are placed in the priority queue. The priority queue consists of all nodes that are pending evaluation. It is also known as the open list.
Evaluation: In the second step, the node with the lowest f(n) value is selected from the queue to be processed next.
Expansion: The node with the lowest f(n) value that was chosen in the previous evaluation step is compared with its neighbours.
Update: After the comparison, if a neighbouring node has a lower f(n) value than the one previously recorded, then it is updated and added to the open list.
Goal Check: In this step, the goal check is done to determine whether the goal node is achieved. This can also be confirmed if the open list is vacant, which tells that there is nothing more for further traversal.
Reconstruction: In the final step, the path is reconstructed by backtracking or tracing the steps from the final or goal node or the initial node.

Pseudocode of A* Search Algorithm

Before seeing the actual implementation of A* Search Algorithm in Python, let’s see the Pseudocode to understand its working.

function A_star_search(begin, goal):
// here we are initializing open and close list
Open_List = [begin] // Nodes pending to be evaluated
Closed_List = [] // The nodes are already evaluated
// here we are initializing the node properties
begin.g = 0 // it is the cost from begin is 0
begin.h = heuristic(begin, goal) // Estimate to goal
begin.f = begin.g + begin.h // Total estimated cost
begin.parent = null // For path r_construction
while Open_List is not vacant:
// Getting the node having lowest f value
// for faster retrieval of the best node
//curr=current
curr = node in Open_List with lowest f value
// checking whether we have reached the goal
if curr = goal:
return r_construct_path(curr)
// moving the curr node from Open_List to Closed_List
remove curr from Open_List
add curr to Closed_List
// here we checking the neighboring nodes
for each neighbor of curr:
if neighbor in Closed_List:
continue // Skip already evaluated nodes
// here we are calculating tent g score and tent=tentative
tent_g = curr.g + distance(curr, neighbor)
if neighbor not in Open_List:
add neighbor to Open_List
else if tent_g >= neighbor.g:
continue // This path isn’t better
// This known best path till now
neighbor.parent = curr
neighbor.g = tent_g
neighbor.h = heuristic(neighbor, goal)
neighbor.f = neighbor.g + neighbor.h
return failure // if no path exists
function r_construct_path(curr):
path = []
while curr is not null:
add curr to beginning of path
curr = curr.parent
return path

Now we are ready to look at the implementation of A* Search Algorithm in Python.

Implementing the A* search algorithm in Python

Let’s see the implementation of the A* search algorithm in Python through the code.

Code:

#importing the required modules
import heapq
import math
#defining the class Node
class Node:
def __init__(self, position, parent=None):
self.position = position
self.parent = parent
self.g = 0 # Distance from start node
self.h = 0 # Heuristic to goal
self.f = 0 # Total cost
def __eq__(self, other):
return self.position == other.position
def __lt__(self, other):
return self.f < other.f
def __repr__(self):
return f”({self.position}, f: {self.f})”
#defining the heuristic function
def heuristic(a, b):
return math.sqrt((a[0] – b[0])**2 + (a[1] – b[1])**2)
#defining the A* algorithm
def astar(maze, start, end):
open_list = []
closed_list = set()
start_node = Node(start)
end_node = Node(end)
heapq.heappush(open_list, start_node)
while open_list:
current_node = heapq.heappop(open_list)
closed_list.add(current_node.position)
if current_node == end_node:
path = []
while current_node:
path.append(current_node.position)
current_node = current_node.parent
return path[::-1]
for new_position in [(0, -1), (0, 1), (-1, 0), (1, 0), (-1, -1), (-1, 1), (1, -1), (1, 1)]:
node_position = (current_node.position[0] + new_position[0], current_node.position[1] + new_position[1])
if node_position[0] > (len(maze) – 1) or node_position[0] < 0 or node_position[1] > (len(maze[len(maze)-1]) – 1) or node_position[1] < 0:
continue
if maze[node_position[0]][node_position[1]] != 0:
continue
new_node = Node(node_position, current_node)
if new_node.position in closed_list:
continue
new_node.g = current_node.g + 1
new_node.h = heuristic(new_node.position, end_node.position)
new_node.f = new_node.g + new_node.h
if add_to_open(open_list, new_node):
heapq.heappush(open_list, new_node)
return None
def add_to_open(open_list, neighbor):
for node in open_list:
if neighbor == node and neighbor.g > node.g:
return False
return True
#inputting the values
maze = [
[0, 0, 1, 1, 1, 0, 0],
[1, 0, 1, 0, 0, 0, 1],
[1, 0, 0, 0, 1, 0, 0],
[1, 1, 1, 0, 1, 1, 0],
[0, 0, 0, 0, 0, 0, 0]
]
start = (0, 1)
end = (2, 6)
path = astar(maze, start, end)
print(path)

Output:

[(0, 1), (1, 1), (2, 2), (2, 3), (1, 4), (2, 5), (2, 6)]

Explanation:

In the above example, we showed an example of the A* star search algorithm using the Maze representation, which is an Euclidean distance heuristic function.

In the maze representation, 0 is walkable and 1 is blocked. There are 8 movements allowed, which are up, down, right, left, and diagonals. It uses the heapq library for an open list to select the list with the lowest f(n), and the formula is f(n)= g(n) + h(n). We defined start and end nodes.

Finally, it returned the optimal and efficient path as a list of coordinates.

Understanding Heuristic Search h(n)

Heuristic function h(n) plays a crucial role in the A* search algorithm as it helps to find the estimated cost of traversing to the final or goal node from the current node.

There are several types of Heuristic search. Let’s understand them in detail:

Manhattan Distance: The Manhattan Distance is calculated as the sum of the net differences between the current position and the goal position of x and y coordinates.

Sounds confusing, right? Let’s understand this with a formula that you might have learned in geometry.

Formula:

h(n) = |x _current – x _goal| + |y _current – y _goal|

Where,

h(n) = Heuristic Function

Euclidean Function: The Euclidean distance is calculated as the square root of the sum of the squares of the differences between the x and y coordinates of the current position and the goal position.

Let’s see the above statement in the form of a formula.

Formula:

h(n) = √(x _current – x _goal)² + (y _current – y _goal)²

Diagonal Distance: The Diagonal is the distance that is measured when the number of movements of directions possible is 8. For example, a Robot vacuum cleaner or a King in the game of chess.

Let’s see the above statement in the form of a formula.

Formula:

h(n) = max(|x_current – x_goal|, |y_current– y_goal|

Advantages of A* Search Algorithm in Artificial Intelligence

The A* search algorithm offers several advantages in artificial intelligence and problem-solving scenarios:

Optimal solution: A* ensures finding the optimal (shortest) path from the start node to the destination node in the weighted graph given an acceptable heuristic function. This optimality is a decisive advantage in many applications where finding the shortest path is essential.
Completeness: If a solution exists, A* will find it, provided the graph does not have an infinite cost This completeness property ensures that A* can take advantage of a solution if it exists.
Efficiency: A* is efficient ifan efficient and acceptable heuristic function is used. Heuristics guide the search to a goal by focusing on promising paths and avoiding unnecessary exploration, making A* more efficient than non-aware search algorithms such as breadth-first search or depth-first search.
Versatility: A* is widely applicable to variousproblem areas, including wayfinding, route planning, robotics, game development, and more. A* can be used to find optimal solutions efficiently as long as a meaningful heuristic can be defined.
Optimized search: A* maintains a priority order to select the nodes with the minor f(n) value (g(n) and h(n)) for expansion. This allows it to explore promising paths first, which reduces the search space and leads to faster convergence.
Memory efficiency: Unlike some other search algorithms, such as breadth-first search, A* stores only a limited number of nodes in the priority queue, which makes it memory efficient, especially for large graphs.
Tunable Heuristics: A*’s performancecan be fine-tuned by selecting different heuristic functions. More educated heuristics can lead to faster convergence and less expanded nodes.
Extensively researched: A* is a well-established algorithm with decades of research and practical applications. Many optimizations and variations have been developed, making it a reliable and well-understood troubleshooting tool.
Web search: A* can be used for web-based path search, where the algorithm constantly updates the path according to changes in the environment or the appearance of new It enables real-time decision-making in dynamic scenarios.

Disadvantages of A* Search Algorithm in Artificial Intelligence

Although the A* (letter A) search algorithm is a widely used and powerful technique for solving AI pathfinding and graph traversal problems, it has disadvantages and limitations. Here are some of the main disadvantages of the search algorithm:

Heuristic accuracy: The performance of the A* algorithm depends heavily on the accuracy of the heuristic function used to estimate the cost from the current node to the If the heuristic is unacceptable (never overestimates the actual cost) or inconsistent (satisfies the triangle inequality), A* may not find an optimal path or may explore more nodes than necessary, affecting its efficiency and accuracy.
Memory usage: A* requires that all visited nodes be kept in memory to keep track of explored paths. Memory usage can sometimes become a significant issue, especially when dealing with an ample search space or limited memory resources.
Time complexity: AlthoughA* is generally efficient, its time complexity can be a concern for vast search spaces or graphs. In the worst case, A* can take exponentially longer to find the optimal path if the heuristic is inappropriate for the problem.
Bottleneck at the destination: In specific scenarios, the A* algorithm needs to explore nodes far from the destination before finally reaching the destination region. This the problem occurs when the heuristic needs to direct the search to the goal early effectively.
Cost Binding: A* faces difficulties when multiple nodes have the same f-value (the sum of the actual cost and the heuristic cost). The strategy used can affect the optimality and efficiency of the discovered path. If not handled correctly, it can lead to unnecessary nodes being explored and slow down the algorithm.
Complexity in dynamic environments: In dynamic environments where the cost of edges or nodes may change during the search, A* may not be suitable because it does not adapt well to such changes. Reformulation from scratch can be computationally expensive, and D* (Dynamic A*) algorithms were designed to solve this
Perfection in infinite space : A* may not find a solution in infinite state space. In such cases, it can run indefinitely, exploring an ever-increasing number of nodes without finding a solution. Despite these shortcomings, A* is still a robust and widely used algorithm because it can effectively find optimal paths in many practical situations if the heuristic function is well-designed and the search space is manageable. Various variations and variants of A* have been proposed to alleviate some of its limitations.

Applications of the A* Search Algorithm in Artificial Intelligence

The search algorithm A* (letter A) is a widely used and robust pathfinding algorithm in artificial intelligence and computer science. Its efficiency and optimality make it suitable for various applications. Here are some typical applications of the A* search algorithm in artificial intelligence:

Pathfinding in Games: A* is oftenused in video games for character movement, enemy AI navigation, and finding the shortest path from one location to another on the game map. Its ability to find the optimal path based on cost and heuristics makes it ideal for real-time applications such as games.
Robotics and Autonomous Vehicles: A* is used in robotics and autonomous vehicle navigation to plan anoptimal route for robots to reach a destination, avoiding obstacles and considering terrain costs. This is crucial for efficient and safe movement in natural environments.
Maze solving: A* can efficiently find the shortest path through a maze, making it valuable in many maze-solving applications, such as solving puzzles or navigating complex structures.
Route planningand navigation: In GPS systems and mapping applications, A* can be used to find the optimal route between two points on a map, considering factors such as distance, traffic conditions, and road network topology.
Puzzle-solving: A* can solve various diagram puzzles, such as sliding puzzles, Sudoku, and the 8-puzzle problem. Resource Allocation: In scenarios where resources must be optimally allocated, A* can help find the most efficient allocation path, minimizing cost and maximizing efficiency.
Network Routing: A* can be usedin computer networks to find the most efficient route for data packets from a source to a destination node.
Natural Language Processing (NLP): In some NLP tasks, A* can generate coherent and contextualresponses by searching for possible word sequences based on their likelihood and relevance.
Path planningin robotics: A* can be used to plan the path of a robot from one point to another, considering various constraints, such as avoiding obstacles or minimizing energy consumption.
Game AI: A* is also used to makeintelligent decisions for non-player characters (NPCs), such as determining the best way to reach an objective or coordinate movements in a team-based game.

Comparison of A* Search Algorithm with Other Search Algorithms

Algorithm	Optimality	Use of Heuristics	Efficiency in Large Graphs	Suitable Use Cases
BFS	Guarantees the shortest path for unweighted graphs	None	Inefficient in large graphs	Unweighted graph traversal
DFS	Does not guarantee the shortest path	None	May explore irrelevant paths	Depth-based exploration, maze solving
Dijkstra	Guarantees the shortest path for weighted graphs	None	Slower due to exhaustive search	Weighted networks, road mapping
A*	Guarantees the shortest path for admissible heuristics	Utilizes h(n)h(n)	Efficient with good heuristics	Pathfinding with varying weights, AI

Conclusion

Being a powerful and versatile tool in the artificial intelligence domain, the A* search algorithm has famous accolades on its shoulders for its superior efficiency in pathfinding and graph traversal problems. A* intelligently blends actual cost from g(n) with an estimated cost h(n) to the goal and optimally explores, while still being complete under certain conditions.

It acts as a bridge between solely exhaustive search strategies like Breadth first search or Dijkstra’s algorithm, with heuristic developed techniques like Greedy Best first search. The structure of this balance allows A* to achieve optimality with low exploration.

Frequently Asked Questions (FAQs)

1. What is the A* Search Algorithm?

A* Search Algorithm is used to identify the most cost-effective and the shortest path of traversing from one node to another node in a graph.

Here is the formula we use:

f(n)=g(n)+h(n)

2. What is g(n) in A*?

g(n) represents what it takes to reach the initial node from the last node. It’s equal to all the costs of node n that are transferred to other nodes.

3. What is h(n) in A*?

The heuristic cost represents the estimated distance or effort required to move from a given node n to the target node. An admissible heuristic is one that never fails to calculate the actual cost of reaching the goal.

4. What are the common heuristics used in A*?

The most common heuristics used in A* are:

Manhattan Distance – The Manhattan Distance is calculated as the sum of the net differences between the current position and the goal position of x and y coordinates.
Euclidean Distance – The Euclidean distance is calculated as the square root of the sum of the squares of the differences between the x and y coordinates of the current position and the goal position.
Diagonal Distance – The Diagonal is the distance that is measured when the possible number of movements in directions is 8. For example, a Robot vacuum cleaner or a King in the game of chess.

5. Where is A* used in real life?

A* search algorithm is used in the following areas:

GPS and maps (Google Maps)
Robotics navigation
AI in video games
Puzzle solvers (e.g., 8-puzzle, Rubik’s cube)
Network routing

January 20, 2026

Uninformed Search Algorithms
An uninformed search is one in which the search systems do not use any clues about the suitable area, but it depends on the random nature of the Search. Nevertheless, they begin the exploration of search space (all possible solutions) synchronously. The search operation begins from the initial state and provides all possible next steps arrangements until the goal is reached.

These are mostly the simplest search strategies, but they may not be suitable for complex paths that involve irrelevant or even irrelevant components. These algorithms are necessary for solving basic tasks or providing simple processing before passing on the data to more advanced search algorithms that incorporate prioritized information.

Following are the various types of uninformed search algorithms:
1. Breadth-first Search:
- Breadth-first Search is the most common search strategy for traversing a tree or graph. This algorithm searches breadthwise in a tree or graph, so it is called breadth-first Search.
- The BFS algorithm starts searching from the root node of the tree and expands all successor nodes at the current level before moving to nodes of the next level.
- The breadth-first search algorithm is an example of a general-graph search algorithm.
- Breadth-first Search was implemented using a FIFO queue data structure.
Advantages:
- BFS will provide a solution if any solution exists.
- If there is more than one solution for a given problem, then BFS will provide the minimal solution that requires the least number of steps.
- It also helps find the shortest path in the goal state since it needs all nodes at the same hierarchical level before moving to nodes at lower levels.
- It is also very easy to comprehend. With this, we can assign a higher rank among path types.
Disadvantages:
- It requires lots of memory since each level of the tree must be saved in memory to expand to the next level.
- BFS needs lots of time if the solution is far away from the root node.
- It can be a very inefficient approach for searching through deeply layered spaces, as it needs to thoroughly explore all nodes at each level before moving on to the next.
Example:

In the tree structure below, we show the traversing of the tree using the BFS algorithm from root node S to goal node K. The BFS search algorithm traverses in layers, so it will follow the path which is shown by the dotted arrow, and the traversed path will be:

S—> A—>B—->C—>D—->G—>H—>E—->F—->I—->K

Time Complexity: The time Complexity of the BFS algorithm can be obtained by the number of nodes traversed in BFS until the shallowest Node. Where the d= depth of shallowest solution and b is a node at every state.

T (b) = 1+b²+b³+…….+ b^d= O (b^d)

Space Complexity: The space complexity of the BFS algorithm is determined by the memory size of the frontier, which is O(b^d).

Completeness: BFS is complete, which means if the shallowest goal node is at some finite depth, then BFS will find a solution.

Optimality: BFS is optimal if path cost is a non-decreasing function of the depth of the Node.

2. Depth-first Search
- Depth-first Search is a recursive algorithm for traversing a tree or graph data structure.
- It is called the depth-first Search because it starts from the root node and follows each path to its greatest depth node before moving to the next path.
- DFS uses a stack data structure for its implementation.
- The process of the DFS algorithm is similar to that of the BFS algorithm.
Note: Backtracking is an algorithm technique that uses recursion to find all possible solutions.

Advantages:
- DFS requires very little memory as it only needs to store a stack of the nodes on the path from the root node to the current Node.
- It takes less time to reach the goal node than the BFS algorithm (if it traverses in the right path).
- With the help of this, we can store the route that is being tracked in memory to save time, as it only needs to keep one at a particular time.
Disadvantage:
- There is the possibility that many states keep re-occurring, and there is no guarantee of finding a solution.
- DFS algorithm goes for deep-down searching, and sometimes, it may go to the infinite loop.
- The depth-first Search (DFS) algorithm does not always find the shortest path to a solution.
Example:

In the below search tree, we have shown the flow of the depth-first Search, and it will follow the order:

Root node—>Left node —-> Right Node.

It will start searching from root node S and traverse A, then B, then D, and E; after traversing E, it will backtrack the tree as E has no other successor, and the goal node is still not found. After backtracking, it will traverse node C and then G, and here, it will terminate as it found the goal node.

Completeness: The DFS search algorithm is complete within finite state space as it will expand every Node within a limited search tree.

Time Complexity: The time complexity of DFS will be equivalent to the Node traversed by the algorithm. It is given by:

T(n)= 1+ n²+ n³ +………+ n^m=O(n^m)

Where m= maximum depth of any node, and this can be much larger than d (Shallowest solution depth)

Space Complexity: The DFS algorithm needs to store only a single path from the root node. Hence, the space complexity of DFS is equivalent to the size of the fringe set, which is O(bm).

Optimal: The DFS search algorithm is non-optimal, as it may generate a large number of steps or high Cost to reach the goal node.

3. Depth-Limited Search Algorithm:

A depth-limited search algorithm is similar to a depth-first Search with a predetermined limit. Depth-limited Search can solve the drawback of the infinite path in the Depth-first search. In this algorithm, the Node at the depth limit will be treated as having no successor nodes further.

Depth-limited Search can be terminated with two Conditions of failure:
- Standard failure value: It indicates that the problem does not have any solution.
- Cutoff failure value:It defines no solution for the problem within a given depth limit.
Advantages:

Depth-limited Search will restrict the search depth of the tree. Thus, the algorithm will require fewer memory resources than the straight BFS (Breadth-First Search) and IDDFS (Iterative Deepening Depth-First Search). After all, this implies the automatic selection of more segments of the search space and the consequent reason for the consumption of the resources. Due to the depth restriction, DLS omits the predicament of holding the entire search tree within memory, which contemplatively leaves room for a more memory-efficient vice for solving a particular kind of problem.
- When there is a leaf node depth that is as large as the highest level allowed, do not describe its children, and then discard it from the stack.
- Depth-limited Search does not explain the infinite loops that can arise in classical when there are cycles in a graph of cities.
Disadvantages:
- Depth-limited Search also has the disadvantage of incompleteness.
- It may not be optimal if the problem has more than one solution.
- The effectiveness of the Depth-Limited Search (DLS) algorithm is largely dependent on the depth limit specified. If the depth limit is set too low, the algorithm may fail to find the solution altogether.
Example:

Completeness: The DLS search algorithm is complete if the solution is above the depth limit.

Time Complexity: The time complexity of the DLS algorithm is O(b^ℓ), where b is the branching factor of the search tree, and l is the depth limit.

Space Complexity: The space complexity of the DLS algorithm is O(b×ℓ), where b is the branching factor of the search tree, and l is the depth limit.

Optimal: Depth-limited Search can be viewed as a special case of DFS, and it is also not optimal even if ℓ>d.

4. Uniform-cost Search Algorithm:

Uniform-cost Search is a searching algorithm used for traversing a weighted tree or graph. This algorithm is used when a different cost is available for each edge. The primary goal of the uniform-cost Search is to find a path to the goal node that has the lowest cumulative Cost. Uniform-cost Search expands nodes according to their path costs form the root node.

It can be used to solve any graph/tree where the optimal Cost is in demand. The priority queue implements a uniform-cost search algorithm. It gives maximum priority to the lowest cumulative Cost. A uniform cost search is equivalent to the BFS algorithm if the path cost of all edges is the same.

Advantages:
- A uniform cost search is optimal because, at every state, the path with the least cost is chosen.
- It is efficient when the edge weights are small, as it explores the paths in an order that ensures that the shortest path is found early.
- It’s a fundamental search method that is not overly complex that makes it accessible for many users.
- It is a type of comprehensive algorithm that will find a solution if one exists. This means the algorithm is complete and ensures that it can locate a solution whenever a viable one is available. The algorithm covers all the necessary steps to arrive at a resolution.
Disadvantages:
- It does not care about the number of steps involved in searching and is only concerned about path cost. Due to this, this algorithm may be stuck in an infinite loop.
- When in operation, UCS shall know all the edge weights to start the Search.
- This Search holds constant the list of the nodes that it has already discovered in a priority queue. Such is a much weightier thing if you have a large graph. The algorithm allocates the memory by storing the path sequence of prioritizes, which can be memory-intensive as the graph gets larger.
- With the help of a Uniform cost search, we can end up with the problem if the graph has edge cycles with a smaller cost than that of the shortest path.
- The Uniform cost search will keep deploying a priority queue so that the paths explored can be stored in any case as the graph size can be even bigger, which can eventually result in too much memory being used.
Example:

Completeness:

Uniform-cost Search is complete; if there is a solution, UCS will find it.

Time Complexity:

Let C* be the Cost of the optimal solution, and ε is each step to get closer to the goal node. Then the number of steps is = C*/ε+1. Here, we have taken +1, as we start from state 0 and end with C*/ε.

Hence, the worst-case time complexity of the Uniform-cost search is (b^{1 + [C*/ε]})/.

Space Complexity:

The same logic is for space complexity, so the worst-case space complexity of Uniform-cost search is O(b^{1 + [C*/ε]}).

Optimal:

Uniform-cost Search is always optimal as it only selects a path with the lowest path cost.

5. Iterative Deepening Depth-first Search (IDDFS):

The iterative deepening algorithm is a combination of DFS and BFS algorithms. This search algorithm finds out the best depth limit and does it by gradually increasing the limit until a goal is found.

This algorithm performs a depth-first Search up to a certain “depth limit”, and it keeps increasing the depth limit after each iteration until the goal node is found.

This Search algorithm combines the benefits of Breadth-first search’s fast Search and depth-first Search’s memory efficiency.

The iterative search algorithm is useful for uninformed Search when the search space is large, and the depth of the goal node is unknown.

Steps for Iterative deepening depth-first search algorithm:
- Set the depth limit to 0.
- Perform DFS to the depth limit.
- If the goal state is found, return it.
- If the goal state is not found and the maximum depth has not been reached, increment the depth limit and repeat steps 2-4.
- If the goal state is not found and the maximum depth has been reached, terminate the search and return failure.
Advantages:
- It combines the benefits of BFS and DFS search algorithms in terms of fast search and memory efficiency.
- It is a type of straightforward which is used to put into practice since it builds upon the conventional depth-first search algorithm.
- It is a type of search algorithm that provides guarantees to find the optimal solution as long as the Cost of each edge in the search space is the same.
- It is a type of complete algorithm, and the meaning of this is it will always find a solution if one exists.
- The Iterative Deepening Depth-First Search (IDDFS) algorithm uses less memory compared to Breadth-First Search (BFS) because it only stores the current path in memory rather than the entire search tree.
Disadvantages:

The main drawback of IDDFS is that it repeats all the work of the previous phase.

Example:

The following tree structure shows the iterative deepening depth-first Search. The IDDFS algorithm performs various iterations until it cannot find the goal node. The iteration performed by the algorithm is given as:

1’st Iteration—–> A

2’nd Iteration—-> A, B, C

3’rd Iteration——>A, B, D, E, C, F, G

4’th Iteration——>A, B, D, H, I, E, C, F, K, G

In the fourth iteration, the algorithm will find the goal node.

Completeness:

This algorithm is complete if the branching factor is finite.

Time Complexity:

Let’s suppose b is the branching factor, depth is d, and then the worst-case time complexity is O(b^d).

Space Complexity:

The space complexity of IDDFS will be O(bd).

Optimal:

IDDFS algorithm is optimal if path cost is a non-decreasing function of the depth of the Node.

6. Bidirectional Search Algorithm:

A bidirectional search algorithm runs two simultaneous searches, one form the initial state called forward-search and the other from the goal node called backward-search, to find the goal node. Bidirectional Search replaces one single search graph with two small subgraphs in which one starts the Search from an initial vertex and the other starts from a goal vertex. The Search stops when these two graphs intersect each other.

Bidirectional Search can use search techniques such as BFS, DFS, DLS, etc.

Advantages:
- Bidirectional Search is fast.
- Bidirectional Search requires less memory.
- The graph can be extremely helpful when it is very large, and there is no way to make it smaller. In such cases, using this tool becomes particularly useful.
- The Cost of expanding nodes can be high in certain cases. In such scenarios, using this approach can help reduce the number of nodes that need to be expanded.
Disadvantages:
- Implementation of the bidirectional search tree is difficult.
- In bidirectional Search, one should know the goal state in advance.
- Finding an efficient way to check if a match exists between search trees can be tricky, which can increase the time it takes to complete the task.
Example:

In the below search tree, the bidirectional search algorithm is applied. This algorithm divides one graph/tree into two sub-graphs. It starts traversing from node 1 in the forward direction and starts from goal node 16 in the backward direction. The algorithm terminates at node 9, where two searches meet.

Completeness: Bidirectional Search is complete if we use BFS in both searches.

Time Complexity: The time complexity of bidirectional Search using BFS is O(b^d).

Space Complexity: The space complexity of bidirectional Search is O(b^d).

Optimal: Bidirectional Search is Optimal.

Applications of Uninformed Search Algorithms

Blind search algorithms, also referred to as uninformed search algorithms, have quite a few applications in various domains. Even though they look and smell simple and lack typical domain-specific heuristics, these algorithms are remarkably powerful for solving fundamental problems in Artificial Intelligence, network routing, puzzle solving, and web crawling.

Artificial Intelligence
- Pathfinding in Games: This means that characters or agents must traverse through virtual environments in games. For instance, in the type of games in which you must solve mazes, algorithms like Breadth First Search (BFS) and Depth First Search (DFS) are meant for finding paths between different points. Such methods are simple but effective in deterministic environments where the map is completely known.
- Robotics: For uninformed Search in robotics, autonomous systems can use it to explore an environment. For example, Uniform Cost Search (UCS) is a strategy that a robot performing consecutive moves through a warehouse grid might use to find the path of the shortest distance to a target location.
- Planning: Planning with AI is a task that involves finding a sequence of actions to realize a desired goal. With tasks such as scheduling and logistics and making simple decisions in constrained environments, the Depth First Search (DFS) and Iterative Deepening Depth First Search (IDDFS) are commonly used.
Network Routing
- Shortest Path Problems: Uniform-Cost Search (UCS) is the perfect algorithm for finding the cheapest path if we have two network nodes. It is applied to the routing of data packets in computer networks, to find optimal transportation routes, etc., and to manage power grids.
- Connection Troubleshooting: It helps in tracing a path to debug a network to trace all nodes reachable from a fixed source using BFS.
Puzzle Solving
- Sliding Puzzles: Sliding tile puzzles like the 8 puzzle or 15 puzzles are puzzles where tiles need to be arranged into some specific order and solved by finding a sequence of moves to do it. BFS guarantees that the solution will be optimal, and DFS can quickly find any solution.
- N-Queens Problem: The N-Queens Problem is to place exactly N queens on an N x N chessboard with the constraint that no two queens can attack each other. Systematic exploration of board configurations is done with Depth Limited Search and IDDFS.
- Sudoku Solving: While an uninformed search is not the most efficient approach for solving a Sudoku problem, it can help us complete enumerations of possibilities and check for solutions.
Web Crawling
- Efficient Traversal: In particular, BFS is used when web crawling pages to make sure all reachable pages from a starting URL are explored. This method is particularly useful when used to index/search on search engines or during exhaustive Search through connected web pages.
- Depth Management: To control the level of exploration in large web graphs, Depth Limited Search is used to limit the number of links followed so systems are not overloaded.
Strengths of Uninformed Search Algorithms
- Universality: All of it is given in terms of uninformed search algorithms that can be applied to any graph or tree problem. The obvious general-purpose nature of them makes them suitable for general-purpose problem-solving without requiring such problem-specific heuristics.
- Simplicity: They are easy to formulate and even easier to implement. Larger popular examples like BFS and DFS have very little conceptual content that allows you to understand them without much work, making them ideal as both building blocks for other, more sophisticated techniques as well as educational tools.
- Guaranteed Solution (When Applicable): BFS is an algorithm that is complete; it guarantees that a solution is found if one exists, and, in the case of the Search, space is finite. A major advantage of this reliability is in the case where a solution must be ensured, which is not always true.
- Optimality (In Specific Cases): Unlike BFS, to find the shortest path to the goal in unweighted graphs, BFS guarantees to find the shortest path. In cases where the path cost is uniform or irrelevant, this property provides a benefit.
Weaknesses of Uninformed Search Algorithms
- High Resource Consumption (Time and Space): In the worst case, such uninformed search algorithms often explore vast regions of the search space and, therefore, have exponential time complexity. A simple example is that BFS needs to maintain a queue of all frontier nodes, and this will overspend memory in large search spaces.
- Inefficiency in Large or Complex Search Spaces: Algorithms such as DFS are likely to fall into infinite loops in environments with many nodes or having deep solutions. This is because they use no heuristic information for steering, and so they have biased behavior in exploring irrelevant paths.
- Lack of Domain Knowledge Utilisation: Without taking such problem-specific insights into account, uninformed algorithms tend to result in an order of magnitude increase in computational effort. The algorithms, such as A*, use heuristics to guide the Search more efficiently, while in contrast to them, the algorithm is not informed.
- Not Always Optimal: For example, DFS does not ensure a shortest path, as it is concerned with depth before minimizing Cost. For weighted machines, BFS is not optimal either.
January 20, 2026
Search Algorithms in Artificial Intelligence
Search algorithms in AI are the algorithms that are created to aid the searchers in getting the right solution. The search issue contains search space, first start and end point. Now, by performing simulations of scenarios and alternatives, searching algorithms help AI agents find the optimal state for the task.

Logic used in algorithms processes the initial state and tries to get the expected state as the solution. Because of this, AI machines and applications just functioning using search engines and solutions that come from these algorithms can only be as effective as the algorithms.

AI agents can make the AI interfaces usable without any software literacy. The agents that carry out such activities do so with the aim of reaching an end goal and developing action plans that, in the end, will bring the mission to an end. Completion of the action is gained after the steps of these different actions. The AI agents find the best way through the process by evaluating all the alternatives that are present. Search systems are a common task in artificial intelligence by which you are going to find the optimum solution for the AI agents.

Problem-solving Agents:

In Artificial Intelligence, Search techniques are universal problem-solving methods. Rational agents or Problem-solving agents in AI mostly use these search strategies or algorithms to solve a specific problem and provide the best result. Problem-solving agents are the goal-based agents and use atomic representation. In this topic, we will learn various problem-solving search algorithms.

Properties of Search Algorithms:

Following are the four essential properties of search algorithms to compare the efficiency of these algorithms:
- Completeness: A search algorithm is said to be complete if it guarantees to return a solution if at least any solution exists for any random input.
- Optimality: If a solution found for an algorithm is guaranteed to be the best solution (lowest path cost) among all other solutions, then such a solution is said to be an optimal solution.
- Time Complexity: Time complexity is a measure of time for an algorithm to complete its task.
- Space Complexity: It is the maximum storage space required at any point during the search, as the complexity of the problem.
Importance of Search Algorithms in Artificial Intelligence

Here are some important factors of the role of search algorithms used by AI as follows.

1. Solving problems:

“Workflow” logical search methods like describing the issue, getting the necessary steps together and specifying an area to search help AI search algorithms get better at solving problems. Take, for instance, the development of AI search algorithms that support applications like Google Maps by finding the fastest way or shortest route between given destinations. These programs basically search various options to find the best solution possible.

2. Search programming:

Many AI functions can be designed as search oscillations, which thus specify what to look for in formulating the solution to the given problem.

3. Goal-based agents:

Instead, the goal-directed and high-performance systems use a wide range of search algorithms to improve the efficiency of AI. Though they are not robots, these agents look for the ideal route for action dispersion so as to avoid the most impacting steps that can be used to solve a problem. It is their main aims to come up with an optimal solution which takes into account all possible factors.

4. Support production systems:

AI Algorithms in search engines for systems manufacturing help them run faster. These programmable systems assist AI applications with applying rules and methods, thus making an effective implementation possible. Production systems involve learning of artificial intelligence systems and their search for canned rules that lead to the wanted action.

5. Neural network systems:

Beyond this, employing neural network algorithms is also of importance to neural network systems. The systems are composed of these structures: a hidden layer, an input layer, an output layer, and interconnected nodes. One of the most important functions offered by neural networks is to address the challenges of AI within any given scenario. AI is somehow able to navigate the search space to find the connection weights that will be required in the mapping of inputs to outputs. This is made better by search algorithms in AI.

Search Algorithm Terminologies
- Search: Searching is a step-by-step procedure to solve a search problem in a given search space. A search problem can have three main factors:
  - Search Space: Search space represents a set of possible solutions that a system may have.
  - Start State: It is a state from where the agent begins the search.
  - Goal test: It is a function that observes the current state and returns whether the goal state is achieved or not.
- Search Tree: A tree representation of a search problem is called a Search tree. The root of the search tree is the root node, which corresponds to the initial state.
- Actions: It describes all the available actions to the agent.
- Transition Model: A description of what each action does can be represented as a transition model.
- Path Cost: It is a function that assigns a numeric cost to each path.
- Solution: It is an action sequence that leads from the start node to the goal node.
- Optimal Solution: If a solution has the lowest cost among all solutions.
Types of Search Algorithms

Based on the search problems we can classify the search algorithms into uninformed (Blind search) search and informed search (Heuristic search) algorithms.

Uninformed/Blind Search

The uninformed search does not contain any domain knowledge, such as closeness or the location of the goal. It operates in a brute-force way as it only includes information about how to traverse the tree and how to identify leaf and goal nodes. Uninformed search applies a way in which a search tree is searched without any information about the search space, like initial state operators and tests for the goal, so it is also called blind search. It examines each node of the tree until it achieves the goal node.

It can be divided into six main types:
- Breadth-first search
- Uniform cost search
- Depth-first search
- Depth limited search
- Iterative deepening depth-first search
- Bidirectional Search
Informed Search

Informed search algorithms use domain knowledge. In an informed search, problem information is available, which can guide the search. Informed search strategies can find a solution more efficiently than an uninformed search strategy. An informed search is also called a Heuristic search.

A heuristic is a way that might not always be guaranteed for the best solutions but guaranteed to find a good solution in a reasonable time.

Informed search can solve many complex problems that could not be solved in another way.

An example of informed search algorithms is a travelling salesman problem.
- Greedy Search
- A* Search
Conclusion

Search algorithms in Artificial Intelligence (AI) are crucial tools designed to solve complex search problems by navigating through a defined search space that includes a start state and a goal state. These algorithms play a vital role in addressing various AI challenges and also enhance the performance of related systems such as neural networks and manufacturing processes. A typical search algorithm first defines the problems, which involve elements like the initial state, target state, state space, and path cost. It then executes systematic operations to identify the optimal solution.

In AI, search algorithms are broadly categorized into informed and uninformed types. Uninformed algorithms-such as Breadth-First Search, Depth-First Search, and Uniform-Cost Search, explore the search space without any prior knowledge of the goal. In contrast, informed algorithms, including Greedy Search, A* Tree Search, and A* Graph Search, use heuristic information to guide the search more efficiently. These algorithms are widely applied in real-world domains such as vehicle routing, nurse scheduling, document retrieval, and various industrial processes, making them foundational components of intelligent systems.
January 20, 2026
What is Peas in AI
While there are many types of agents running in Artificial Intelligence (AI), these are all agents that serve different purposes. On the level of performance, environment, actuators, and sensors, the PEAS system is a major framework for these types of agents. To understand how different AI agents excel in most environments, there is a system called PEAS.

Assuming such are available, one of them, such as Rational Agents, is the most efficient since they will always take the optimal path for maximum efficiency.

Performance Measure

Defining Performance Measures in AI

Artificial intelligence performance measures are the measures that are used to appraise a strong framework’s progress. These actions are quantitative or subjective matters of assessing how well the framework does the tasks assigned to be done by it. The importance of the decision of the execution measures lies in the fact that it will determine whether an artificial intelligence framework is valid and capable of doing the job it was designed for.

Types of Performance Measures

Depending on the particular idea of the artificial intelligence system as well as the nature of its specific assignments, there are different sorts of performance measures. The normal exhibition measures incorporate exactness, accuracy, review, F1-score, mistake rate, and proficiency. The appropriate measure for the artificial intelligence framework will be determined based on the goals of the artificial intelligence framework and the properties of the problem it is trying to solve.

Role of Performance Measures in AI

Performance measures are indispensable elements in the planning and improvement of an artificial intelligence framework. The optimisation step is guided by them, which means the AI (computer-based intelligence) design can quantify the framework to deliver better results. Additionally, execution estimates enable the connection of various artificial intelligence models and computations so as to pick the best methodology for a specific issue.

Types of Environments

Different kinds of circumstances exist for the artificial intelligence frameworks, and they can perform their operations in both controlled and deterministic, and dynamic and unusual conditions. Although some artificial intelligence applications, for example, advanced mechanics, perform in real, factual situations, there are applications such as normal language handling, managing particular or virtual or computerised spaces. Such an environment has a sizable impact on the intricacy of a framework’s artificial intelligence-related activities.
- Fully Observable & Partially Observable
- Episodic & Sequential
- Static & Dynamic
- Discrete & Continuous
- Deterministic & Stochastic
Significance of Environment Modelling

For artificial intelligence frameworks to make powerful decisions, this modelling of the environment is crucial. The more an AI framework understands its current situation, the more likely it is to achieve the goals that it set for itself. Usually, ecological displaying involves assembling information, feeling tactile information streams, and portraying the environmental components the sound system framework can utilise to direct.

1. Fully Observable & Partially Observable:
- Fully Observable:The specialist can see instantly the condition of the environment in a noticeable environment at a certain time.
- Somewhat Perceptible:The specialist cannot see entirely the environment conditions in a slightly discernible environment. It had deficient or boisterous perceptions.
2. Static and Dynamic:
- Static:The climate in a static environment does not change while the specialist is thinking. It does not vary over time.
- Dynamic:The climate (environmental) can change in a powerful environment, especially when the specialist is operating to decide on their activities. There were the activities of the specialist or external variables that could cause changes to occur.
3. Discrete and Nonstop:
- Discrete:The state space as well as the activity space are both limited and countable in a discrete environment.
- Continuous:In a constant environment, the state space, activity space, or both are stable (inconsistent with discrete qualities), that is, it values the realm of values on it in contrast to discrete qualities.
4. Deterministic and Stochastic:
- Deterministic:This circumstance of the environment, however, is not set in stone (given by the present status and action made by the specialist) in a deterministic environment.
- Stochastic:This includes vulnerability in a stochastic environment. Irregularity or likelihood affects the following condition of the environment, regardless of whether the present status and activity are known.
Actuators

The artificial intelligent system includes parts or some components that perform the activities or secondary reactions defined by the smart system as actuators in artificial intelligence. These are the ways through which the artificial intelligence framework interacts with the environment. Depending on the use case, actuators are available in different shapes and configurations.

Types of Actuators

Functional attributes of actuators are a field of Artificial Intelligence, which can be classified in different ways. For example, they can be actuators (engines or servos) that control the development of robot attachments in advanced mechanics. Actuators can be programmed parts that function to create text, speech, or visual results (as a consequence) under virtual conditions.

The Job of Actuators in an Artificial Intelligence System?

The extension of dynamic cycles of an artificial intelligence framework and the effect of these cycles on the environment is the basis of actuators. In light of how the artificial intelligence system comprehends the climate and what performance measures DWORD intended to achieve, they execute the activities or commands created by the artificial intelligence system. The actuator’s viability and the exactness of the actuators are significant in determining the whole performance of artificial intelligence applications.

Sensors

Sensing the Environment in AI

These are instrumental parts that gather information and data about environmental elements and use artificial intelligence technology as input for the same. Urgent information that they provide them outfit the environment intelligence framework to build a picture of what it is seeing and understanding about it, what is going on in its environment. These are the artificial intelligence system’s sensory apparatus working with all-around informed independent direction.

The Significance of Sensors in AI

An artificial intelligence system depends on sensors as they provide the crude information that drives dynamic cycles. Basic elements to the exactness and unwavering quality of the sensors, as any inconsistency or mistake made in the sensor data will confound the artificial intelligence system to perform flawed exercises. Then, calibration and sensor blend techniques are employed to make the sensors even more accurate.

Integrating PEAS Components

1. Achieving AI Intelligence through Integration

What it means is that AI systems exhibit intelligent behaviour because it is the efficient integration of the PEAS components. Performance metrics guide the AI’s decision-making, and it understands the environment such that it can adapt to changing conditions. When sensors and necessary operations supply the required input are carried out with actuators, and the system is said to be a closed-loop system.

2. Challenges in PEAS Integration

Integrating PEAS components in a sophisticated AI system is often infeasible. Actuators should be designed and tested thoroughly so they react appropriately to the AI decision, and sensors provide accurate data. The next crucial step is to pick appropriate performance metrics that match the goals of the AI system.

3. Case Studies for the Integration of PEAS

As an example, below I considered a self-driving automobile to illustrate the idea of PEAS integration. In this case, the performance of the car to its destination with a quick and safe ride can be used as the performance metric. The environment is the weather, the traffic, and the road.

The vehicle’s actuating elements (also known as actuators) control the vehicle’s braking, steering, and acceleration, and the vehicle’s sensing elements (also known as sensors) obtain data from GPS, LiDAR, and cameras so that all the form data can be used to guide navigation and make decisions.

Types of Agents Using PEAS

The use of the PEAS framework (Performance Measure, Environment, Actuators, Sensors) is required for building and understanding different types of AI agents. Based on the complexity and the extent of use of the PEAS model, the agents are broadly categorised.

Reactive Agents: Simple and Reflexive Applications of PEAS

The key to our AI agent is a reactive agent. Specifically, they take raw sensory tokens to outputs without holding a state and without planning. Such agents are interfaced with conditions or predefined rules to respond to the environment.

How PEAS Applies?
- Performance Measure: It emphasises correctness or speed up which minimises errors or maximises speed. An example was given that a vacuum cleaner agent could have its performance measure be the amount of dirt it cleans every time, for example.
- Environment: Typically, they are employed in static or semi-static ones with changes regularly or not at all, for instance, cleaning a room or recording the temperature of a controlled area.
- Actuators: For example, it can turn on/ off switches, move left or right, or trigger (e.g., start suction of a vacuum cleaner).
- Sensors: For a robotic vacuum that depends on the external dirt sensor, for a thermostat that measures temperature, or for basic sensors that can sense the current state of the environmental conditions.
Examples of Reactive Agents:
- Automatic vacuum cleaners (e.g., Roomba)
- Basic thermostats
- Motion-detection security systems
Advantages:
- Fast and efficient in predictable environments.
- Simple and cost-effective to implement.
Limitations:
- The ability to change or to learn from experience; not the ability to become adaptable to changes that are not expected or to experience effects from not being able to understand.
- Limited scope of functionality in dynamic or complex environments.
Deliberative Agents: PEAS in Planning and Decision-Making

Reactive agents are less basic than deliberative agents. They can have an internal environment model that helps in learning and making informed decisions. In fact, there are often agents who actually exhibit directed behaviour towards an objective and, in fact, use planning algorithms.

How PEAS Applies?
- Performance Measure: In this, it has a set of long-term objectives to complete a task in the shortest possible time or in the least possible efficient manner to reach a destination.
- Environment: Typically, more complex, dynamic, and partially observable. They could include, such as navigation to the city, delivery of packages, etc.
- Actuators: Advanced activities, route selection, task execution, manipulation of object(s) according to planned strategy.
- Sensors: The third generation sensors are what we call the cameras and other things that generate super detailed input from the visual and location sources, respectively.
Examples of Deliberative Agents
- Autonomous vehicles plan optimal routes.
- Delivery drones.
- Scheduling systems for manufacturing processes.
Advantages
- Capable of handling complex environments with dynamic changes.
- It can decide the goals available along with the resources currently at hand.
Limitations
- Computationally intensive due to planning and decision-making processes.
- Perhaps it must know the environment.
Learning Agents: Integration of Feedback for Evolving PEAS Configurations

It is therefore the highest class of the most adaptable and intelligent agents. And consequently, they will become better and can learn from what they have done. Furthermore, this feedback from the environment makes them more able to make their decisions and take their actions.

How PEAS Applies?
- Performance Measure: Metrics that improve with learning, for instance, decreasing error rate, accuracy, or task time minimum, etc.
- Environment: They are complex and dynamic spaces that have no predictable names.
- Actuators: The Flexibility and adaptability of these actions are based on the first ones, which are learning algorithms. For example, if we are thinking about some near-gameplay adaptive actions (if thinking of some game-play related actions), or if we are thinking of an alternate trading strategy for a game (or updating a trading algorithm).
- Sensors: Continuously and rich and diverse input sensors that draw from the video feed as well as other video feeds and user interaction.
Examples of Learning Agents:
- Recommendation systems (e.g., Netflix, Amazon).
- Gaming applications of AI (e.g., AlphaGo and Chess entries).
- Personal assistants (e.g., Siri, Google Assistant).
Advantages:
- Over time, it can change greatly, and it can merge into other places.
- Handles complex and unpredictable scenarios effectively.
Limitations:
- High computational and data requirements.
- Risk of overfitting or incorrect learning in the absence of proper feedback mechanisms.
AI PEAS Examples

To demonstrate the PEAS framework, I will examine some instances.

Driverless Cars
- Performance Measure:The measures for driverless cars are to guarantee passenger safety and punctual arrivals, safety navigation, and effective route planning.
- Environment:In addition to the roads, traffic patterns, pedestrians, and weather, the environment must be traversed by the automobile.
- Actuators:The braking, steering, and accelerating systems of the car’s parts are carried by actuators that do what the AI has prescribed.
- Sensors:Sensors on the car, such as Lidar, cameras, GPS, radar, and more, let the vehicle see and respond to the world around it in real time.
Industrial Robots
- Performance Measure: You have to make sure that you have reached the maximum productivity. Secondly, you need to get the maximum precision of your processes, the third is to minimise the energy demand, and the last is to minimise the downtime.
- Environment: Factory floors, conveyor belts, machines, raw materials, and other robots or workers.
- Actuators: These can be grippers, robotic arms, conveyor motors, welding, or painting tools, etc.
- Sensors: Cameras, proximity sensors, force sensors, temperature sensors, and motion detectors.
Virtual Assistants (e.g., Alexa, Siri)
- Performance Measure: It finds out and responds to user queries correctly and executes the command as soon as possible, thus making the user happy.
- Environment: Smart devices like voice inputs and smart home devices, mobile apps, and cloud.
- Actuators: Speakers, app interfaces, text-to-speech converters, and smart device controllers all produce the output of text-to-speech converters.
- Sensors: Data sources via microphones through the voice input and APIs.
Gaming AI (e.g., Chess Engines, Video Game NPCs)
- Performance Measure: The game will become quite a hard yet pleasant game, where players wish to win the game or just raise their score for players.
- Environment: Game board (e.g., chess), virtual worlds, other players, and in-game rules.
- Actuators: In addition, having the ability to move chess pieces, control in-game characters, trigger animations, or events.
- Sensors: A player action, the game state, and environmental conditions (enemy position in an RPG).
January 20, 2026
What is the Turing Test in AI
In 1950, Alan Turing introduced a test to check whether a machine can think like a human or not; this test is known as the Turing Test. In this test, Turing proposed that a computer can be said to be intelligent if it can mimic human responses under specific conditions.

Turing introduced the Turing Test in his 1950 paper, “Computing Machinery and Intelligence,” which considered the question, “Can machines think?”

The Turing test is based on a party game, “Imitation Game,” with some modifications. This game involves three players, in which one player is the Computer, another player is the human responder, and the third player is the human Interrogator, who is isolated from the other two players and whose job is to find which player is the machine among the two of them.

Consider Player A is a computer, Player B is human, and Player C is an interrogator. The interrogator is aware that one of them is a machine, but he needs to identify this on the basis of questions and their responses.

The conversation between all players is via keyboard and screen, so the result would not depend on the machine’s ability to convert words into speech.

The test result does not depend on each correct answer, but only on how closely it responds like a human answer. The computer is permitted to do everything possible to force a wrong identification by the interrogator.

The questions and answers can be like:

Interrogator: Are you a computer?

Player A (Computer): No

Interrogator: Multiply two large numbers, such as (256896489*456725896)

Player A: Long pause and gives the wrong answer.

In other words, if an interrogator cannot tell which is human and which is a machine, the computer passes the test, and it is said to be intelligence or a thinking machine.

The prize competition was announced in 1991 by New York businessman Hugh Loebner, offering $100,000 to the first computer to pass the Turing test. But no AI program to date has yet passed an undiluted Turing test.’

History of the Turing Test

In the history of Artificial Intelligence (AI), the 1950s Turing Test introduced by Alan Turing is a very remarkable milestone. In his paper ‘Computing Machinery and Intelligence,’ he came to light. To answer this profound question, Turing sought to replicate machine intelligence that is comparable to the human kind.

Turing had become interested in the idea of creating thinking machines that display intelligent behavior; this curiosity was the basis of it. The Turing Test was his practical method for deciding whether such a machine could converse naturally with a human being enough to be considered human.

This test was the foundation of AI research and the first discussion of machine intelligence because of Turing’s work on it. This offered a basis for the evaluation of AI systems. The Turing Test has changed structurally and continues to form a hobby relative to improvement and the need for debate. Nevertheless, it was of huge historical importance in developing AI and continues to be of motivation to current researchers and a benchmark to measure AI progress.

Variations of the Turing Test

New versions of the Turing Test have been proposed throughout the years in an attempt to overcome these limitations and reflect on the true capabilities of an AI:
1. Total Turing Test: It is an extended version of the Turing test that is not restricted to text-based conversation. It determines how well the machine understands and reacts to not only words, but also visual and physical cues that the interrogator provides. It includes seeing the things shown to it and taking the desired actions about them. Essentially, it tried to determine whether the AI can walk in the world in a way that reflects a more in-depth understanding of the world.
2. Reverse Turing Test: Here, the roles are reversed, with a twist on the traditional Turing Test. Here, it is the machine itself that acts as the interrogator. It is supposed to distinguish humans from other machines based on the responses it gets. With this reversal, the AI is put to the test to evaluate the intelligence of your kind, as it becomes possible to detect artificial intelligence.
3. Multimodal Turing Test: The Multimodal Turing Test is a concept to assess AI’s capability to process and respond to various modes of communication at the same time in a world where communication can be in many ways. The thing looks into whether or not AI can effortlessly process and reply to text, speech, images, and maybe even other modalities all at once. This was a variation that accepted the many ways in which we communicate and asked if the AI can run with the complex ways in which we engage.
Chatbots to attempt the Turing test

ELIZA: Joseph Weizenbaum was the creator of ELIZA, a Natural language processing computer program. The reason for being was to prove the ability of machines to communicate with humans. The Turing Test always implied an attempt to bring forth one of the first chatterbots.

Parry: Kenneth Colby created a chatterbot named Parry in 1972. Parry was designed to simulate a schizophrenic individual (the most common chronic mental disorder). They described Parry as ‘ELIZA with attitude’. In the early 1970s, Parry was tested using a variant of the Turing Test.

Eugene Goostman: Eugene Goostman, a chatbot, was created in Saint Petersburg in 2001 and participated in various versions of the Turing Test. Goostman won the competition declared to be the largest Turing test competition in the world, with 29 percent of judges being fooled into thinking it was human. Goostman resembled a 13-year-old virtual boy.

The Chinese Room Argument

Many philosophers were against the whole concept of Artificial Intelligence. I heard of the most famous argument on this list, that of the ‘Chinese room’.

In 1980, John Searle produced the thought experiment in his paper ‘Mind, Brains, and Program’ called the ‘Chinese Room’, and it contradicted the notion of Turing’s test. In his argument, he said, Programming a computer will make a computer capable of talking in words, but a real understanding of words and consciousness of a computer.

Whatever those devices (Machines such as ELIZA and Parry) might pass the Turing test with ease by playing with the keywords and symbols, they at once had no real understanding of language, he added. Therefore, it is not even a ‘thinking’ capability, such as a human.

Features Required for a Machine to Pass the Turing Test
- Natural language processing: But in this case, the most frequent format for the Interrogator’s communication with us is in a language that would be used by humans in general, for example, in English.
- Knowledge representation: As a means of storage and retrieval of information during the test.
- Automated reasoning: Answering these is a matter of using the information already stored.
- Machine learning: Adapters that should be able to allow the business services to adapt to the new changes (through the new collaboration models) and the patterns (that are inherent in business services).
- Vision (For total Turing test): A test is something that recognises the action of the interrogator and other objects.
- Motor Control (For total Turing test): This: To do if what is asked is triggered.
Limitations of the Turing Test
- Not a True Measure of Intelligence: That’s not only that; it’s not even machine intelligence, and even less so machine consciousness just because it passes the Turing Test. This type of criticism of the computer’s power to recreate human-like responses without or with understanding or consciousness is John Searle’s ‘Chinese Room’ objection.
- Simplicity of Test Scenarios: As to the Turing test, the world of human attention that will be occupied in text-based human interaction will lack all of what a machine can and cannot do to see and respond to the world.
Applications of the Turing Test in Artificial Intelligence

Role in Chatbot and Virtual Assistant Development

Another fascinating type of AI application is Chatbots and virtual assistants such as Chatgpt, Alexa, Siri, etc., which try to replicate the same human communication with all its merits.
- Designing Human-like Conversations: Developers often create chatbots so that the chatbot can as much as possible pass the Turing Test, or pass as a human. This is really about being in context, changing tone, and returning vehicles, etc, of a given context.
- The Evaluation of User Engagement: When searching for such AI systems that meet the Turing Test criteria, they are found to be smarter in dealing with the human desire to have human interest between humans and humans. For example, most of the historical performance of the virtual assistants has included several conversational innuendoes joined together to form a ‘personalised’ nature, i.e., humour, empathy, etc.
- Iterative Improvement Based on Feedback: This is done by methods inspired by the Turing Test, following the form of Continuous Refinement Through Feedback. Systems that are refined are those that do not rely on their ability to handle ambiguous queries, while at the same time being able to detect humour and give intelligent responses.
Advancing Natural Language Understanding

The development of natural language understanding (NLU) plays a key role in AI at a speed due to the Turing test.
- Contextual Understanding: The idea of Turing is about contextual understanding. For example, it is also in such demand that AI systems have had to invent sophisticated processing algorithms to run in the background and generate flowing, following conversations.
- Semantic Analysis and Disambiguation: Therefore, AI models have been trained such that they can mitigate the close semantic difference between different senses of polysemous words and sentences over a context in order to pass the Turing Test.
- Enhanced Machine Translation: The need for the transmission of information in a human way, in such a way that its accuracy and cultural importance, beyond being compliant with the natural language patterns, encourage machine translation.
Benchmarking Human-like Behaviour in AI Systems

The Turing Test, therefore, becomes a sense measure for the behaviour of an AI system as to whether or not it is ‘human’.
- Evaluating Conversational Competence: For example, it is assessed by its ability to engage in human forms of thought in interaction. If we do so, it is a good sign because we are working on something to get human-equivalent intelligence for some of the tasks related to that.
- Comparative Analysis across AI Systems: This test can classify the area(s) of strengths and the areas where the AI systems need to improve. It encourages greater competition and innovation on the part of developers to streamline more and better models.
- Setting Goals for General AI: While the inspired long-term goals for developing general AI systems that can perform a variety of human-like tasks are specific to the domain, even for current AI systems, it is still too many.
Advantages of the Turing Test
- Simplicity and Intuitive Understanding: The Tutoring Test is an easy-to-understand and simple process. Defined as testing something that is around human interaction, something that is intuitive to the point that even an average person with technical knowledge can test the machine’s intelligence.
- Focus on Practical Outcomes: In the end, this test will be judged to be desirable as it is simply a conceived mesh system of humans and machines.
- Encourages Natural Language Processing (NLP) Development: Since it is an NLP-based test, it has good development in NLP, so it’s a necessary part of chatbots, virtual assistants, and basically all conversational AI systems.
- Universal Appeal: Whether solved using a problem or not, and whether respecting a Turing test or not, we can talk about the Turing Test not needing any problem solution as a dependency.
- Historical and Philosophical Significance: The Turing Test became a sort of hot point of philosophical discussion on the topic of intelligence and consciousness, at the same time being a most popular area of AI and cognitive science research.
Disadvantages of the Turing Test
- Ambiguity in Defining Intelligence: The test does not examine other types of intelligence, e.g., creativity, problem-solving, or ethical reasoning, but tests the machine’s capacity to imitate human behaviour.
- Focus on Deception: Once one has passed the Turing Test, it often means tricking the human judge rather than having real intelligence. And it is precisely this reliance on deception that the superficial results from AI do not tell of true progress.
- Bias toward Human-Like Behaviour: The problem here is that it assumes human-like behaviour of intelligence, and there is no reason that we cannot be as an intelligent or indeed more so than nonhuman beings than humans.
- Limited Scope: Talking is what is meant by the Turing Test, and little or nothing else is involved. Physical tidy-up, perceiving, and reasoning.
- Vulnerability to Predefined Scripts: Such script responses or loopholes circumvent the test to mislead about the actual power of AI systems, possibly.
- Ethical and Philosophical Critiques: Philosophers like John Searle (Chinese Room Argument) have questions about whether passing the Turing Test stood in for intelligence, understanding, or awareness and, by extension, whether it is a valid test of intelligence.
January 20, 2026
Agent and Environment in Artificial Intelligence
In Artificial Intelligence, an ‘agent’ is an intelligent system with sensors and actuators that operates in some environment and is attempting to satisfy certain objectives. The environment refers to the outside world around which the agent lives and describes the context in which the agent takes action.

The agents can vary from basic thermometers to sophisticated robots. Each one follows a perception-action control cycle, acting based on the information it extracts from the environment, and no two are alike. This is the unique thing about AI: interaction-being able to sense an environment as some agent and act adequately, being able to perceive what is important.

An agent in AI is an autonomous object in the world that can perceive and respond. That’s because when you have an agent in your environment, you have to adapt to the climate as well essentially and the agent is going to have to make some series of decisions.

Examples of Agents:
- Robotic Vacuum Cleaners: They can identify the presence of obstacles and clean the floors effectively with the help of sensors used in them.
- Self-Driving Cars: Self-driving cars use their senses to perceive the road environment and make decisions on the road to drive safely based on the current conditions.
- Virtual Assistants: Virtual assistants like Alexa or Siri interpret the inputs from the users, comprehend commands, and perform tasks ranging from sending reminders to even operating smart devices.
What is an Environment in AI?

The environment in the context of Artificial Intelligence, therefore, refers to the conditions surrounding an agent in order to achieve a given task. It is the circumstance in which the agent performs and which furnishes it with feedback. A physical space or a virtual space can be designed to model real processes or to model concepts.

The agent gets feedback from the environment in terms of its action, and the environment itself determines the amount of rewards it will gain from the completion of its goals.

Examples of Environments
- A maze for a robot navigating toward a goal.
- A virtual environment in which an AI can play a game and confront other players.
- Real-life environment of AVs, such as traffic conditions, weather, and road conditions, among others.
- By this, AI systems are developed to work in a certain way to overcome such issues and accomplish their aims in any relevant environment.
Agent Terminology

It is also vital to get familiar with the key opinions regarding agents to understand better how they work in their environments:
- Percept: Percept refers to the signal that an agent receives from its environment through its sensors or input that an agent can receive or acquire from its surroundings. For instance, a robotic vacuum cleaner infers the layout of the area and/or objects that it may encounter within the region.
- Percept Sequence: The entire record of all the percepts an agent has perceived since a certain time. It assists the agent in making decisions based on the past inputs it has received. For example, a self-driving car utilizes a percept sequence to make changes in its decision-making process based on previous experience in traffic patterns.
- Action: The action is the response or output of the perceived percept or percept sequence that the agent initiates. Actions are taken through appendages that include a robot’s arm, fingers, and other executing instruments in an AI-assisted system, such as making a move in a game of chess.
Types of Intelligent Agents

Intelligent agents are classified according to their functions and capabilities, as well as the intelligence that they possess.

Simple Reflex Agents

These agents operate within a current state and are not cognizant of historical data. Answers come from the event-condition-action in which the users trigger an event, and next, the agent will look to the event-condition-action list and the corresponding predefined actions.

Model-based Reflex Agents

These agents act like reflex agents, but their knowledge of their surroundings is more extensive. Subsequently, a model of the world is built into the internal system that includes the agent’s history.

Goal-based Agents

These are the agents that are also known as the rational agents since they contain, in addition to the information stored by the model-based agents, goal information or information describing a desirable world.

Utility-based agents

These agents are similar to goal-based agents, but they have an extra utility range that compares each of them based on the goal and chooses the best action. Some examples of the rating criteria include the probability of success or the necessary resources.

Types of Environment in AI

Several kinds of AI environments are regularly applied. Some of the types of environments in AI include deterministic, stochastic, fully observable, partially observable, continuous, discrete, episodic, sequential, static, dynamic, single-agent, multi-agent, competitive, and collaborative.

Fully Observable vs Partially Observable Environment

Based on how much information the agent has about the status of the environment at any given moment, the first type of environment in AI can be either fully or partially observed.

The condition in which the agent has complete knowledge of the environment’s present state is known as the fully observable environment. Specializing environment: The agent alone has access to every aspect of the environment needed for making decisions. Checkers, chess, and other games are instances of fully visible environments.

A partially observable environment is one where an agent cannot get a full view of the environment at a specific moment. The agent can only interact with a part of the environment, and the part of the environment that is inaccessible to the agent may be a number of things. A few examples of partially observable environments are when you are trying to drive your car through traffic.

Deterministic vs Stochastic

The environment in artificial intelligence can be categorized as stochastic or deterministic based on the predictability of the results from an agent’s action. An environment is said to be deterministic if the result of an action can be predicted with absolute certainty. In fact, the outcome of an agent’s behavior is entirely determined by the condition of the environment. When an agent’s actions directly result in consequences, the environment is said to be deterministic.

Examples of deterministic environments are cases where the environment has clear and predictable responses, such as simple mathematical problems where the result of each arithmetic operation is distinctly described.

A stochastic environment, on the other hand, is one where the results of an action are not guaranteed but include probability. The environment just has a part in producing the result of an agent’s task, and there is an element or probability involved. Some examples of stochastic situations include games of chance, such as the play of card games, such as poker, or games that involve the spin of a wheel, such as roulette.

Competitive vs. Collaborative

In types of environment in AI, another one will be classified under either the competitive or cooperative type, based on the nature of the relationships that the agents display, where the agents may be in direct competition with each other or cooperate towards the completion of a shared objective.

A competitive environment is a situation where many agents are fighting to achieve different objectives. The performances of all the individual agents are dependent on bottlenecks and other agents, and the agents have to cooperate and compete in an endeavor to execute their goals. One good example of a competitive environment is games such as chess.

In a collaborative environment, there are a number of agents involved in some form of cooperative project. Agent success only comes with the success of the other agents, and to accomplish the goals and targets developed, the agents need to work together cohesively. Some of the works related to collaborative environments include factors such as search and rescue.

Single-agent vs Multi-agent

The environment in Artificial Intelligence may be categorized as either a single or multi-agent environment based on the number of entities that are in the environment.

A single-agent environment refers to a system where an agent has to act to accomplish a certain task and has to take action alone. Some examples of single-agent environments include puzzles and maze games. The agent has to apply some relevant search algorithms or planning methods to find its way to the goal state.

A multi-agent environment is a setting where the different agents engage in transactions and act upon the surroundings with a view to attaining personal or mutual objectives. Classical examples of multi-agent environments are multiplayer games and traffic simulations. Thus, to decide on the agent’s behavior, the agents must apply game theory or call on multi-agent reinforcement learning.

Static vs Dynamic

Another way of categorizing the environment in AI is through changing or non-changing environments, that is, the type of change it undergoes.

A static environment is constant and unchanging. The environment’s state is immutable, and the agent’s activities won’t alter it. Examples of static environments are problems in a math class or puzzles like a Rubik’s cube. The agent could, for instance, use the search algorithm or a decision tree to improve the way the agent behaves.

An environment that is always changing is said to be dynamic. The environment’s state is immutable, and the agent’s activities won’t alter it. Examples of dynamic environments are video games or robotics applications. Thus, the agent has to employ methods such as planning or reinforcement learning with the purpose of improving its activity in accordance with the new environment.

Discrete vs Continuous

The environment within Artificial Intelligence can be categorized into discrete or continuous based on the state and action space.

The collection of all potential states that the environment may be in is known as the state space. For example, the state space in a chess game would be a collection of all possible places for pieces on the checkered board. In a robotic control job, the state space can contain data on the location or velocity of the robot and its surroundings, among others.

The action space is the action set from which the agent selects the action to be taken in any of the states of the environment. For example, if one is in a game such as chess, the action space would be defined as the set of moves that one can make in the game. When performing a robotic control job, the action space may contain orders to alter the robot’s speed or direction.

It is an environment where both the state and action spaces are countable or are always discrete. A discrete environment is where the section of space occupied by the participants of a game is strictly deterministic, for instance, a board game like chess or checkers. The agent’s decision can be made using some method, such as a search algorithm or a decision tree.

On the other hand, in a continuous environment, both state and action spaces are constant and infinite. Continuous environments include robotics or control systems since the function up to the point of discontinuity is continuous.

As seen above, the control of the state-transition model in a constant environment requires that the agent’s decision-making process is also continuous in the sense that the state and action spaces are continuous. It has to incorporate skills such as reinforcement learning or optimization so as to learn and adapt as well.

Episodic vs Sequential

In AI, based on the given work and the mapping between the agent’s action and environment, the environment can be episodic or sequential.

An episodic environment describes a scenario in which an agent taking an action cannot change the future states of an environment. It is aimed at maximizing the immediate shifting received at the end of each episode by the agent as opposed to the N-step. Chess is one of the examples of games that are played in an episodic environment. The agent can also employ models such as the Monte Carlo method or Q-learning to ensure the best policy for each episode.

However, in the sequential environment, the decisions of the agent impact the future states of the environment. Such an agent aims to finally achieve the highest total of the obtained rewards after several interactions. Sequential environments are, for instance, Robotics applications or video games.

Due to the existence of a look-ahead, the agent has to employ dynamic planning methods such as dynamic programming or reinforcement learning in order to attain the best policy for many steps.

Known vs Unknown

The environment in artificial intelligence can be and is categorized on the basis of the amount of information about the environment, such as a known environment and an unknown environment.

An environment is considered known when the agent is fully aware of its payoffs, transition functions, and environmental regulations. The agent is always fully aware of the set of actions open to it; the result of each action is fully predictable. Typically known environments include games such as chess or tic-tac-toe. In a known environment, the agent is able to combine appropriate strategies, such as search algorithms or decision trees, in its operation.

An example of an unknown environment is one where the agent has no understanding or very little understanding of the rules of the environment, the state transition, and the rewards that may be expected from any action. The agent may not know the actions allowable in this particular state, or the result of the action may be unpredictable.

It should be noted that unknown environments are, for example, exploration tasks or real-life applications. Techniques such as reinforcement learning or the exploration-exploitation dilemma must be applied to optimize the agent’s behavior when it is in an environment that it cannot identify.

However, it is crucial to understand that the distinction between Known vs Unknown and fully observable vs partially observable environments is orthogonal. For instance, an environment can be recognized as a known but partially observable environment, or it can be identified as an unknown, fully observable environment.

Turing Test and Environment Interaction

The Turing Test is a test that measures an agent’s capability and efficiency in basing their actions on a normal human being. As stated earlier, this type of test is sensitive to the complexity of the environment, and the extent to which the agent can meet it is indicative of its adaptability in this environment.

In basic settings, an agent is just able to follow specific procedures, and in an environment that is complex, unpredictable, and ethereal, the agent needs to be able to reason, solve problems, learn, and choose in the best interest.

In fact, knowledge about the environment is one of the most important requirements for the design of agents to perform optimally in any given conditions.
January 20, 2026

Intelligent Agent in AI

An AI system can be defined as the study of the rational agent and its environment. The agents sense the environment through sensors and act on their environment through actuators. An AI agent can have mental properties such as knowledge, belief, intention, etc.

What is an Agent?

An agent can be anything that perceiveits environment through sensors and act upon that environment through actuators. An Agent runs in the cycle of perceiving, thinking, and acting. An agent can be:

Human-Agent: A human agent has eyes, ears, and other organs which work for sensors and hand, legs, vocal tract work for actuators.
Robotic Agent: A robotic agent can have cameras, infrared range finder, NLP for sensors and various motors for actuators.
Software Agent: Software agent can have keystrokes, file contents as sensory input and act on those inputs and display output on the screen.

Hence the world around us is full of agents such as thermostat, cellphone, camera, and even we are also agents.

Before moving forward, we should first know about sensors, effectors, and actuators.

Sensor: Sensor is a device which detects the change in the environment and sends the information to other electronic devices. An agent observes its environment through sensors.

Actuators: Actuators are the component of machines that converts energy into motion. The actuators are only responsible for moving and controlling a system. An actuator can be an electric motor, gears, rails, etc.

Effectors: Effectors are the devices which affect the environment. Effectors can be legs, wheels, arms, fingers, wings, fins, and display screen.

Intelligent Agents:

An intelligent agent is an autonomous entity which act upon an environment using sensors and actuators for achieving goals. An intelligent agent may learn from the environment to achieve their goals. A thermostat is an example of an intelligent agent.

Following are the main four rules for an AI agent:

Rule 1: An AI agent must have the ability to perceive the environment.
Rule 2: The observation must be used to make decisions.
Rule 3: Decision should result in an action.
Rule 4: The action taken by an AI agent must be a rational action.

Rational Agent:

A rational agent is an agent which has clear preference, models uncertainty, and acts in a way to maximize its performance measure with all possible actions.

A rational agent is said to perform the right things. AI is about creating rational agents to use for game theory and decision theory for various real-world scenarios.

For an AI agent, the rational action is most important because in AI reinforcement learning algorithm, for each best possible action, agent gets the positive reward and for each wrong action, an agent gets a negative reward.

Note: Rational agents in AI are very similar to intelligent agents.

Rationality:

The rationality of an agent is measured by its performance measure. Rationality can be judged on the basis of following points:

Performance measure which defines the success criterion.
Agent prior knowledge of its environment.
Best possible actions that an agent can perform.
The sequence of percepts.

Note: Rationality differs from Omniscience because an Omniscient agent knows the actual outcome of its action and act accordingly, which is not possible in reality.

Structure of an AI Agent

The task of AI is to design an agent program which implements the agent function. The structure of an intelligent agent is a combination of architecture and agent program. It can be viewed as:

Agent = Architecture + Agent program

Following are the main three terms involved in the structure of an AI agent:

Architecture: Architecture is machinery that an AI agent executes on.

Agent Function: Agent function is used to map a percept to an action.

f:P* → A

Agent program: Agent program is an implementation of agent function. An agent program executes on the physical architecture to produce function f.

PEAS Representation

PEAS is a type of model on which an AI agent works upon. When we define an AI agent or rational agent, then we can group its properties under PEAS representation model. It is made up of four words:

P: Performance measure
E: Environment
A: Actuators
S: Sensors

Here performance measure is the objective for the success of an agent’s behavior.

PEAS for self-driving cars:

Let’s suppose a self-driving car then PEAS representation will be:

Performance: Safety, time, legal drive, comfort

Environment: Roads, other vehicles, road signs, pedestrian

Actuators: Steering, accelerator, brake, signal, horn

Sensors: Camera, GPS, speedometer, odometer, accelerometer, sonar.

Example of Agents with their PEAS representation

Agent	Performance measure	Environment	Actuators	Sensors
1. Medical Diagnose	Healthy patientMinimized cost	PatientHospitalStaff	TestsTreatments	Keyboard (Entry of symptoms)
2. Vacuum Cleaner	CleannessEfficiencyBattery lifeSecurity	RoomTableWood floorCarpetVarious obstacles	WheelsBrushesVacuum Extractor	CameraDirt detection sensorCliff sensorBump SensorInfrared Wall Sensor
3. Part -picking Robot	Percentage of parts in correct bins.	Conveyor belt with parts,Bins	Jointed ArmsHand	CameraJoint angle sensors.

January 20, 2026

Types of Agents in AI

Any entity with sensors for sensing its environment and actuators for acting on its environment is an agent in the field of artificial intelligence (AI). Agents are a key component in the design of an intelligent system because agents have the potential to learn, adapt, make choices, and interact with the environment. AI agents are of many types based on their complexity, capabilities, and tasks; they can be as simple as a reflex agent or as sophisticated as a learning agent.

Agents can be grouped into five classes based on their degree of perceived intelligence and capability. All these agents can improve their performance and generate better actions over time. These are given below:

Simple Reflex Agent
Model-based reflex agent
Goal-based agents
Utility-based agent
Learning agent

1. Simple Reflex agent:

The Simple reflex agents are the simplest agents. These agents make decisions on the basis of the current percepts and ignore the rest of the percept history. These agents only succeed in a fully observable environment. The Simple reflex agent does not consider any part of the percepts’ history during their decision and action process. The Simple reflex agent works on a Condition-action rule, which means it maps the current state to an action. Such as a Room Cleaner agent, it works only if there is dirt in the room.

Problems with the simple reflex agent design approach:

They have very limited intelligence
They do not have knowledge of non-perceptual parts of the current state
Mostly too big to generate and to store.
Not adaptive to changes in the environment.

Here is an implementation of a simple reflex agent.

Code:

import matplotlib.pyplot as plt  

import matplotlib.patches as patches  

from time import sleep  

  

# Defining  the simple reflex agent rule  

def agent_Simple_Reflex(loc, stat):  

    if stat == "Dirty":  

        return "Suck"  

    elif loc == "A":  

        return "Move Right"  

    elif loc == "B":  

        return "Move Left"  

  

# Visualizing with the function  

def environment_draw(loc, environment, step, act):  

    fig, ax = plt.subplots(figsize=(6, 3))  

  

    # Draw two rooms, A and B  

    ax.add_patch(patches.Rectangle((0, 0), 3, 3, fill=True, color='gray' if environment["A"] == "Dirty" else 'lightgreen'))  

    ax.text(1.5, 1.5, "A\n" + environment["A"], ha='center', va='center', fontsize=12)  

  

    ax.add_patch(patches.Rectangle((3, 0), 3, 3, fill=True, color='gray' if environment["B"] == "Dirty" else 'lightgreen'))  

    ax.text(4.5, 1.5, "B\n" + environment["B"], ha='center', va='center', fontsize=12)  

  

    # Drawing agent  

    X_agent = 1.5 if loc == "A" else 4.5  

    ax.plot(X_agent, 2.5, marker="o", markersize=20, color="blue")  

    ax.text(X_agent, 2.9, "🤖", ha='center', va='center', fontsize=14)  

  

    # Text info  

    ax.set_title(f"Step {step}: {act}")  

    ax.axis("off")  

    plt.pause(1)  

    plt.close()  

  

# Simulating the  environment and visualizing  

def run_vacuum_world_with_visualization(start_loc, environment):  

    loc = start_loc  

    for step in range(1, 6):  # Run for 5 steps  

        stat = environment[loc]  

        act = agent_Simple_Reflex(loc, stat)  

  

        # Visualization  

        environment_draw(loc, environment, step, act)  

  

        # Updating the environment  

        if act == "Suck":  

            environment[loc] = "Clean"  

        elif act == "Move Right":  

            loc = "B"  

        elif act == "Move Left":  

            loc = "A"  

  

#environment setup  

state_environment = {"A": "Dirty", "B": "Dirty"}  

run_vacuum_world_with_visualization("A", state_environment.copy())  

<textarea>

Output:

The agent cleaned the rooms based on the current perception without memory or planning, but failed to act optimally when both rooms were clean or when actions required memory.

2. Model-based reflex agent

The Model-based agent can work in a partially observable environment, and track the situation.
A model-based agent has two important factors:
- Model: It is knowledge about “how things happen in the world,” so it is called a Model-based agent.
- Internal State: It is a representation of the current state based on percept history.
These agents have the model, “which is knowledge of the world” and based on the model they perform actions.
Updating the agent state requires information about:
1. How the world evolves
2. How the agent’s action affects the world.

Here is an implementation of a model based reflex agent.

Code:

import matplotlib.pyplot as plt  

import matplotlib.patches as patches  

  

# Function of Model-based reflex agent   

def model_based_reflex_agent(loc, percept_stat, model_internal):  

    # Updating the internal model with current percept  

    model_internal[loc] = percept_stat  

  

    # Decision rules  

    if percept_stat == "Dirty":  

        return "Suck"  

    elif model_internal["A"] == "Dirty":  

        return "Move Left"  

    elif model_internal["B"] == "Dirty":  

        return "Move Right"  

    else:  

        return "NoOp"  # Do nothing if everything is clean  

  

# Visualizing with function  

def environment_draw(loc, state_env, step, act):  

    fig, ax = plt.subplots(figsize=(6, 3))  

  

    # Room A  

    ax.add_patch(patches.Rectangle((0, 0), 3, 3, color='gray' if state_env["A"] == "Dirty" else 'lightgreen'))  

    ax.text(1.5, 1.5, "A\n" + state_env["A"], ha='center', va='center', fontsize=12)  

  

    # Room B  

    ax.add_patch(patches.Rectangle((3, 0), 3, 3, color='gray' if state_env["B"] == "Dirty" else 'lightgreen'))  

    ax.text(4.5, 1.5, "B\n" + state_env["B"], ha='center', va='center', fontsize=12)  

  

    # Agent  

    X_agent = 1.5 if loc == "A" else 4.5  

    ax.plot(X_agent, 2.5, marker="o", markersize=20, color="blue")  

    ax.text(X_agent, 2.9, "🤖", ha='center', va='center', fontsize=14)  

  

    ax.set_title(f"Step {step}: {act}")  

    ax.axis("off")  

    plt.pause(1)  

    plt.close()  

  

# Running the model-based reflex agent simulation  

def run_model_based_agent(start_loc, environment):  

    loc = start_loc  

    model_internal = {"A": "Unknown", "B": "Unknown"}  

  

    for step in range(1, 7):  # Run for 6 steps  

        stat_curr = environment[loc]  

        act = model_based_reflex_agent(loc, stat_curr, model_internal)  

  

        environment_draw(loc, environment, step, act)  

  

        # Update environment and loc  

        if act == "Suck":  

            environment[loc] = "Clean"  

        elif act == "Move Right":  

            loc = "B"  

        elif act == "Move Left":  

            loc = "A"  

        elif act == "NoOp":  

            break  # Stop when everything is clean  

  

# Start the environment  

state_environment = {"A": "Dirty", "B": "Dirty"}  

run_model_based_agent("A", state_environment.copy())  

<textarea>

Output:

The agent successfully cleaned both rooms using internal memory to track the environment state, allowing it to behave more intelligently than the simple reflex agent.

3. Goal-based agents

The knowledge of the current state environment is not always sufficient to decide for an agent to decide what to do. The agent needs to know its goal which describes desirable situations. Goal-based agents expand the capabilities of the model-based agent by having the “goal” information.

They choose an action so that they can achieve the goal. These agents may have to consider a long sequence of possible actions before deciding whether the goal is achieved or not. Such considerations of different scenarios are called searching and planning, which makes an agent proactive.

Here is an implementation of a Goal Based agent.

Code:

import matplotlib.pyplot as plt  

import matplotlib.patches as patches  

  

# Planning the logic for goal-based agent  

def goal_based_agent(loc, world_state, goal):  

    # Plan: Clean both rooms  

    plan = []  

  

    # If current room is dirty, clean it  

    if world_state[loc] == "Dirty":  

        plan.append("Suck")  

    else:  

        # Check if the other room is dirty and go there  

        other = "B" if loc == "A" else "A"  

        if world_state[other] == "Dirty":  

            move_act = "Move Right" if loc == "A" else "Move Left"  

            plan.append(move_act)  

        else:  

            plan.append("NoOp")  # Do nothing if goal is achieved  

    return plan[0]  

  

# Visualization function  

def draw_vacuum_world(loc, world_state, step, act):  

    fig, ax = plt.subplots(figsize=(6, 3))  

  

    # Room A  

    ax.add_patch(patches.Rectangle((0, 0), 3, 3, color='gray' if world_state["A"] == "Dirty" else 'lightgreen'))  

    ax.text(1.5, 1.5, "A\n" + world_state["A"], ha='center', va='center', fontsize=12)  

  

    # Room B  

    ax.add_patch(patches.Rectangle((3, 0), 3, 3, color='gray' if world_state["B"] == "Dirty" else 'lightgreen'))  

    ax.text(4.5, 1.5, "B\n" + world_state["B"], ha='center', va='center', fontsize=12)  

  

    # Agent  

    X_agent = 1.5 if loc == "A" else 4.5  

    ax.plot(X_agent, 2.5, marker="o", markersize=20, color="blue")  

    ax.text(X_agent, 2.9, "🤖", ha='center', va='center', fontsize=14)  

  

    ax.set_title(f"Step {step}: {act}")  

    ax.axis("off")  

    plt.pause(1)  

    plt.close()  

  

# Simulate the Goal-Based Agent  

def run_goal_based_agent(start_loc, environment):  

    loc = start_loc  

    goal = {"A": "Clean", "B": "Clean"}  # Define goal  

  

    for step in range(1, 7):  

        stat_curr = environment[loc]  

        act = goal_based_agent(loc, environment, goal)  

  

        draw_vacuum_world(loc, environment, step, act)  

  

        # Update environment  

        if act == "Suck":  

            environment[loc] = "Clean"  

        elif act == "Move Right":  

            loc = "B"  

        elif act == "Move Left":  

            loc = "A"  

        elif act == "NoOp":  

            break  # Stop if goal is achieved  

  

# Initial state  

env_initironment = {"A": "Dirty", "B": "Dirty"}  

run_goal_based_agent("A", env_initironment.copy())  

<textarea>

Output:

By organizing its activities according to the current and goal states and halting when the goal was reached, the agent was able to accomplish its objective of cleaning both rooms.

4. Utility-based agents

These agents are similar to the goal-based agent but provide an extra component of utility measurement, which makes them different by providing a measure of success at a given state. Utility-based agents act based not only on goals but also the best way to achieve the goal.

The Utility-based agent is useful when there are multiple possible alternatives, and an agent has to choose in order to perform the best action. The utility function maps each state to a real number to check how efficiently each action achieves the goals.

Here is an implementation of a Utility Based agent.

Code:

# Importing Libraries  

import matplotlib.pyplot as plt  

import matplotlib.patches as patches  

  

def utility_calcu(state_env, act):  

    utility = 0  

    if state_env["A"] == "Clean" and state_env["B"] == "Clean":  

        utility += 100  

    elif state_env["A"] == "Dirty" or state_env["B"] == "Dirty":  

        utility += 10  

  

    if act == "Suck":  

        utility += 0  

    elif act.startswith("Move"):  

        utility -= 5  

    elif act == "NoOp":  

        utility -= 1  

  

    return utility  

  

# decision-making on the base of utility  

def utility_based_agent(loc, state_env):  

    possible_acts = ["Suck", "Move Right", "Move Left", "NoOp"]  

    best_act = None  

    utility_best = float('-inf')  

  

    for act in possible_acts:  

        state_simulated = state_env.copy()  

        sloc_simulated = loc  

  

        # taking the effect of action  

        if act == "Suck":  

            state_simulated[sloc_simulated] = "Clean"  

        elif act == "Move Right":  

            sloc_simulated = "B"  

        elif act == "Move Left":  

            sloc_simulated = "A"  

  

        utility = utility_calcu(state_simulated, act)  

        if utility > utility_best:  

            utilityutility_best = utility  

            best_act = act  

  

    return best_act  

  

# Visualizing with function  

def environment_draw(loc, state_env, step, act):  

    fig, ax = plt.subplots(figsize=(6, 3))  

  

    # Drawing Room A  

    ax.add_patch(patches.Rectangle((0, 0), 3, 3, color='gray' if state_env["A"] == "Dirty" else 'lightgreen'))  

    ax.text(1.5, 1.5, "A\n" + state_env["A"], ha='center', va='center', fontsize=12)  

  

    # Drawing Room B  

    ax.add_patch(patches.Rectangle((3, 0), 3, 3, color='gray' if state_env["B"] == "Dirty" else 'lightgreen'))  

    ax.text(4.5, 1.5, "B\n" + state_env["B"], ha='center', va='center', fontsize=12)  

  

    # Drawing the Agent  

    X_agent = 1.5 if loc == "A" else 4.5  

    ax.plot(X_agent, 2.5, marker="o", markersize=20, color="blue")  

    ax.text(X_agent, 2.9, "🤖", ha='center', va='center', fontsize=14)  

  

    ax.set_title(f"Step {step}: {act}")  

    ax.axis("off")  

    plt.pause(1)  

    plt.close()  

  

# Running the  utility-based simulation  

def run_utility_agent(start_loc, state_env):  

    loc = start_loc  

    for step in range(1, 7):  

        stat_curr = state_env[loc]  

        act = utility_based_agent(loc, state_env)  

  

        environment_draw(loc, state_env, step, act)  

  

        # Performing the  action  

        if act == "Suck":  

            state_env[loc] = "Clean"  

        elif act == "Move Right":  

            loc = "B"  

        elif act == "Move Left":  

            loc = "A"  

        elif act == "NoOp":  

            break  # stop if action is do nothing  

  

# Starting the  simulation  

env_init = {"A": "Dirty", "B": "Dirty"}  

run_utility_agent("A", env_init.copy())  

<textarea>

Output:

By effectively balancing cleaning with minimal movement, the agent chose actions that maximized its utility and avoided pointless operations after utility was maximized.

5. Learning Agents

A learning agent in AI is a type of agent that can learn from its past experiences, or it has learning capabilities. It starts to act with basic knowledge and then is able to act and adapt automatically through learning. A learning agent has mainly four conceptual components, which are:

Learning element: It is responsible for making improvements by learning from the environment
Critic: Learning element takes feedback from the critic, which describes how well the agent is doing with respect to a fixed performance standard.
Performance element: It is responsible for selecting external action
Problem generator: This component is responsible for suggesting actions that will lead to new and informative experiences.
Hence, learning agents are able to learn, analyze performance, and look for new ways to improve performance.

Here is an implementation of a Learning agent.

Code:

import random  

import matplotlib.pyplot as plt  

import matplotlib.patches as patches  

  

# Agent that learns cleaning strategy over time  

class SmartVacuum:  

    def __init__(self):  

        # Stores learned value of each (room, state) pair  

        self.memory = {  

            ("A", "Dirty"): 10,  

            ("B", "Dirty"): 10,  

            ("A", "Clean"): 0,  

            ("B", "Clean"): 0  

        }  

        self.rate = 0.5    # How much new info changes memory  

        self.discount = 0.9  # Not used in this version  

  

    def decide(self, room, condition):  

        # Very basic policy: clean if dirty, else switch room  

        if condition == "Dirty":  

            return "Clean"  

        return "Go Right" if room == "A" else "Go Left"  

  

    def update_memory(self, room, condition, points):  

        state = (room, condition)  

        old_val = self.memory.get(state, 0)  

        self.memory[state] = old_val + self.rate * (points - old_val)  

  

# Simple visual feedback  

def draw_rooms(agent_spot, room_state, count, move):  

    fig, ax = plt.subplots(figsize=(6, 3))  

  

    # Show room A  

    ax.add_patch(patches.Rectangle((0, 0), 3, 3, color='gray' if room_state["A"] == "Dirty" else 'lightgreen'))  

    ax.text(1.5, 1.5, f"A\n{room_state['A']}", ha='center', va='center', fontsize=12)  

  

    # Show room B  

    ax.add_patch(patches.Rectangle((3, 0), 3, 3, color='gray' if room_state["B"] == "Dirty" else 'lightgreen'))  

    ax.text(4.5, 1.5, f"B\n{room_state['B']}", ha='center', va='center', fontsize=12)  

  

    # Draw agent  

    bot_x = 1.5 if agent_spot == "A" else 4.5  

    ax.plot(bot_x, 2.5, marker="o", markersize=20, color="navy")  

    ax.text(bot_x, 2.9, "🤖", ha='center', va='center', fontsize=14)  

  

    ax.set_title(f"Turn {count}: {move}")  

    ax.axis("off")  

    plt.pause(0.8)  

    plt.close()  

  

# Running the agent through steps  

def run_simulation():  

    # Start with random dirt in rooms  

    rooms = {  

        "A": random.choice(["Clean", "Dirty"]),  

        "B": random.choice(["Clean", "Dirty"])  

    }  

  

    bot_place = "A"  

    cleaner = SmartVacuum()  

  

    for i in range(1, 11):  

        condition = rooms[bot_place]  

        next_move = cleaner.decide(bot_place, condition)  

  

        draw_rooms(bot_place, rooms, i, next_move)  

  

        # Action results in reward or penalty  

        if next_move == "Clean":  

            reward = 10  

            rooms[bot_place] = "Clean"  

        elif "Go" in next_move:  

            reward = -1  

            bot_place = "B" if bot_place == "A" else "A"  

        else:  

            reward = 0  

  

        # Agent updates its value estimate  

        cleaner.update_memory(bot_place, rooms[bot_place], reward)  

  

# Start the program  

run_simulation()  

<textarea>

Output:

Through experience-based behavior modification and reward-based learning, the agent gradually enhanced its performance, leading to more intelligent and effective cleaning.

January 20, 2026

Author: saqibkhan

How means-ends analysis Works?

Operator Subgoaling

Algorithm for Means-Ends Analysis:

Example of Mean-Ends Analysis:

Solution:

Uses of Means-Ends Analysis

Implementation

Advantages of Hill climb algorithm:

Disadvantages of Hill Climbing Algorithm

Features of Hill Climbing:

State-space Diagram for Hill Climbing:

Different regions in the state space landscape:

Types of Hill Climbing Algorithm:

1. Simple Hill Climbing:

Algorithm for Simple Hill Climbing:

2. Steepest-Ascent hill climbing:

Algorithm for Steepest-Ascent hill climbing:

3. Stochastic hill climbing:

Hybrid Methods

1. Hill climbing combined with simulated annealing

2. Mountain Climbing and Genetic Algorithms

3. Memetic Algorithms’ Hill Climbing

Problems in Hill Climbing Algorithm:

Simulated Annealing:

Applications of Hill Climbing Algorithm

Implementation

Formula of A* Search Algorithm

Key components

How does the A* Search Algorithm work?

Step-by-Step Implementation of the A* Algorithm

Pseudocode of A* Search Algorithm

Implementing the A* search algorithm in Python

Understanding Heuristic Search h(n)

Advantages of A* Search Algorithm in Artificial Intelligence

Disadvantages of A* Search Algorithm in Artificial Intelligence

Applications of the A* Search Algorithm in Artificial Intelligence

Comparison of A* Search Algorithm with Other Search Algorithms

Conclusion

Frequently Asked Questions (FAQs)

1. Breadth-first Search:

Advantages:

Disadvantages:

Example:

2. Depth-first Search

Note: Backtracking is an algorithm technique that uses recursion to find all possible solutions.

Advantages:

Disadvantage:

Example:

3. Depth-Limited Search Algorithm:

Advantages:

Disadvantages:

Example:

4. Uniform-cost Search Algorithm:

Advantages:

Disadvantages:

Example:

5. Iterative Deepening Depth-first Search (IDDFS):

Steps for Iterative deepening depth-first search algorithm:

Advantages:

Disadvantages:

Example:

Completeness:

6. Bidirectional Search Algorithm:

Advantages:

Disadvantages:

Example:

Applications of Uninformed Search Algorithms

Artificial Intelligence

Network Routing

Puzzle Solving

Strengths of Uninformed Search Algorithms

Weaknesses of Uninformed Search Algorithms

Problem-solving Agents:

Properties of Search Algorithms:

Importance of Search Algorithms in Artificial Intelligence

Search Algorithm Terminologies

Types of Search Algorithms

Uninformed/Blind Search

Informed Search