Help With COMP3620/6320 Assignment 3Help With Python Assignment

COMP3620/6320 Artificial Intelligence

Assignment 3: SAT-Based Planning

The Australian National University

Semester 1, 2019

May 7, 2019

1 Background

SAT-based planning is a powerful approach to solve planning problems that relies on unrolling a propo-

sitional logic theory over time and checking whether or not a parallel plan exists.

Early SAT encodings of planning problems were generated by hand, but now SAT based planners provides

automatic grounding for the actions, reachability analysis (to prune provably unreachable propositions

or actions), and automatic translation to the corresponding formula.

Today SAT-based planning techniques do best when solving planning problems that allow for a high

degree of parallelism and where the number of time steps to reach a goal state is not very high.

In this assignment you will implement various automatic translations of STRIPS planning instances

(supplied as PDDL files) into SAT. The tedious part of the work has already been done (grounding the

PDDL into objects representing STRIPS actions and propositions, generating a plangraph that computes

fluent mutex relationships, calling the SAT solver with the encodings you generate, and validating the

resulting plans). This should allow you to focus on the fun parts, that is generating CNF encodings of

planning problems and interpreting the solutions found by the SAT solver.

The figure at the top of this handout is a graphical representation of the precondition and effect clauses

in a five-step encoding of a small logistics problem. The blue nodes represent fluents and the red nodes

represent actions. There is an arc between an action and a fluent if they appear in a precondition or

effect clause together.

2 Preliminary: The Planning System

In this assignment you will complete parts of a SAT-based planning system implemented in Python.

This system uses some pre-compiled binaries for grounding the planning problem and solving the SAT

instances your encodings will create.

The planner takes a planning problem specified as a domain PDDL file and a problem PDDL file. It then

uses the selected encoding (and other options) to generate and solve CNF SAT instances with planning

horizons chosen by the selected query (evaluation) strategy. If one of these instances is satisfiable, the

system extracts and attempts to validate a plan from the satisfying assignment returned by the SAT

solver.

Due to the dependence on pre-compiled binaries for grounding and SAT solving and Unix-specific system

calls to run these binaries and manage the temporary files created by the system, the system is only

guaranteed to work on x64 Linux and Mac machines. If you are using Windows 10, we suggest doing

this assignment inside the Windows Subsystem for Linux.

If you have trouble with the supplied binaries gringo and precosat, you can obtain other binaries and

source from https://potassco.org/ and http://fmv.jku.at/precosat/.

To display details about how to run the system, use python planner.py -h. The planner is run with

the command:

python3 planner.py DOMAIN PROBLEM EXPNAME HORIZON [options]

where

• DOMAIN is the PDDL domain file.

• PROBLEM is the PDDL problem file.

• EXPNAME is an arbitrary string used to store temporary files of the experiment.

• HORIZONS is used to set a maximum number of time steps. Different options can be selected:

– If the fixed query strategy is chosen, then the horizon should be a list of planning horizons

separated by : characters. For example, 1:5:7 would plan for the horizons 1, 5, and 7.

– If the ramp query strategy is chosen, then the horizon should be three numbers start:end:step

– the starting horizon, end horizon, and horizon step size. For example, 2:8:2 would plan at

the horizons 2, 4, 6, and 8.

See below on how to select the query strategy.

• -o OUTPUT specifies the file in which the resulting plan is stored (default: None).

• -q QUERY specifies the query strategy to be used: either fixed (default) or ramp.

• -p PLANGRAPH is a boolean specifying whether graphplan preprocessing is used or not (default:

false).

• -l PGCONS specifies what constraints should be included in encodings from the plangraph (default:

both):

– fmutex includes just the fluent mutex axioms.

– reachable includes just reachable action axioms.

– both includes both sets of axioms.

• -x EXECSEM specifies the execution semantics (default: parallel):

– serial means that at most one action can be executed per time step.

– parallel means that multiple actions per time step can be selected as long as any order is

a valid one.

• -e ENCODING specifies the CNF encoding to be used (default: basic). You can also select lo-

gistics to activate the advanced exercise.

• -s SOLVER selects the SAT solver to use. There is only one installed with the system currently, so

ignore this option.

• -t TIMEOUT specifies an optional timeout (in seconds) for each run of the SAT solver (default:

None).

• -d DBGCNF is a boolean that specifies if the system should generate a CNF file annotated with

variable names for you to use to debug your encodings. If set to true, it outputs a .cnf_dbg file

into the tmp_files directory. If there’s an error in your implementation and you’re not sure what

the problem is, turn this flag to true and manually examine the debug file. (default: false).

• -r REMOVETMP is a boolean that specifies whether the system has to remove the temporary files

generated (default: false).

Here’s an example command:

python3 planner.py benchmarks/miconic/domain.pddl benchmarks/miconic/problem01.pddl miconic1 4

Some hints and implementation notes:

• The important information to help you write your encodings is located in strips_problem.py.

There you will find the data structures used to represent the STRIPS planning problems.

• The directory benchmarks contains planning problems in PDDL format. Planning problems come

in two pieces: a domain describing the model of the actions and a problem file describing the

initial state, goal and objects of your interest.

• Use the small problems to test your code as the big ones can take centuries to be solved as long as

you do not have a good action encoding or specific domain knowledge.

• The solver won’t think a correct plan has been found until Exercise 9 has been completed.

• The comments in cnf_encodings/basic.py provide further instructions on how to answer Exer-

cise 1–9. Please read them carefully.

3 From Actions to CNF formulas

The first part of this assignment is on the generation of a CNF encoding for your STRIPS problem. The

CNF has to encode the set of possible state transitions up to a maximum horizon. Please have a look

at the lecture slides for more details, before starting the assignment.

We are going to encode grounded STRIPS planning problems. Such a problem is a tuple < P,A, I,G >

where P is the set of propositions, A is a set of actions (with their preconditions and effects), I ⊆ P the

initial state expressed in the closed world form, and G ⊆ P is a set of atoms that needs to be true at

the end of the plan execution.

The system only allows you to add CNF clauses to your encodings. So, you will need to translate the

planning axioms to implement into clauses (on paper, in your head, etc.) and then write code which

generates and adds these clauses. Here is a brief recap of the main transformation steps that you need

to do in order to turn any formula into a CNF representation:

1. Re-write all (A↔ B) as (A→ B) ∧ (B → A).

2. Re-write all (A→ B) as (¬A ∨B).

3. Translate the formula into NNF (negation normal form) by pushing negations “inwards”, so that

there is no negation next to a symbol other than propositional symbols (actions and fluents).

This will involve applying the double negation elimination and De Morgan’s Laws. For example,

¬(A ∧B) becomes (¬A ∨ ¬B) and ¬(A ∨B) becomes (¬A ∧ ¬B).

4. Distribute over disjunctions. For example, (A ∧ B) ∨ (C ∧D) becomes (A ∨ C) ∧ (A ∨D) ∧ (B ∨

C) ∧ (B ∨D).

5. Finally, remove False and ¬True literals from clauses.

6. Remove clauses with True and ¬False literals.

Note that you don’t need to write any code to transform axioms to CNF. You just need to encode the

final CNF formulae directly in your Python code.

4 Exercises

4.1 Exercise 1: Action and Fluent Variables (5 marks)

For this question you need to implement the propositional variables which will be used to represent your

encoding. Where k is the planning horizon, the variables you need to generate are:

• p@t for each proposition p ∈ P and t ∈ [0, k]

p@t is a fluent denoting that p holds at step t, e.g. on(A,B)@3

• a@t for each a ∈ A and t ∈ [0, k − 1]

a@t is an action fluent denoting that a occurs at step t, e.g. stack(A,B)@2

For example, if you have 3 propositions and 4 actions, and you are looking for a plan which has a

maximum length of 10, you will have (3× 11 + 4× 10) = 73 variables.

Add these variables in the method make_variables in cnf_encodings/basic.py. In the code, each

variable is represented internally with an integer. For example, proposition 1 in step 0 might be repre-

sented with the integer 1, while its negation is represented with the integer -1.

4.2 Exercise 2: Initial State and Goal Axioms (5 marks)

For this question and all subsequent exercises, you need to use the propositional variables you made in

Exercise 1 in clauses representing the various axioms.

• All propositions that are contained in the initial state and only these must hold at step 0 (Closed

World Assumption): ∧

p∈s0

p@0 ∧

∧

p 6∈s0

¬p@0

• The goal condition must be true after k steps:∧

p∈g

p@k

Add these clauses in the method make_initial_state_and_goal_axioms in cnf_encodings/basic.py.

4.3 Exercise 3: Precondition and Effect Axioms (5 marks)

For each action a ∈ A, if a occurs at step t then:

• Its preconditions must be true at step t:

a@t→

∧

p∈pre(a)

p@t

• Its positive effects are true at step t+ 1:

a@t→

∧

p∈eff+(a)

p@t+1

• Its negative effects are false at step t+ 1:

a@t→

∧

p∈eff−(a)

¬p@t+1

In this planning system, we will consider any action that adds and deletes the same proposition to be

invalid. Such actions are weeded out during the grounding process.

Add these clauses in the method make_precondition_and_effect_axioms in cnf_encodings/basic.py.

4.4 Exercise 4: Explanatory Frame Axioms (10 marks)

These clauses state that the only way a fluent can change truth value is via the execution of an action

that changes it.

For each proposition p ∈ P , if p occurs at step t < k then:

• If a fluent becomes true, then an action must have added it:

(¬p@t ∧ p@t+1)→

∨

a∈A

p∈eff+(a)

a@t

• If a fluent becomes false, then an action must have deleted it:

(p@t ∧ ¬p@t+1)→

∨

a∈A

p∈eff−(a)

a@t

Add these clauses in the method make_explanatory_frame_axioms in cnf_encodings/basic.py.

4.5 Exercise 5: Serial Mutex Axioms (10 marks)

These clauses prevent any actions whatsoever from being executed in parallel. For each pair of actions

(a, a′) ∈ A, where a 6= a′ and both a and a′ occurs at step t:

• The actions cannot occur in parallel: ∧

a,a′∈A2,a6=a′

¬a@t ∨ ¬a′@t

You should notice that some actions cannot be executed in parallel, even without adding explicit mutex

clauses. To get full marks for this question, only add mutex clauses for pairs of actions which are not

already ruled out by inconsistent effects.

Note that since serial mutex axioms remove parallelism which typically leads to encodings that are

less efficient to solve, they are not normally used in SAT planners unless one really wants to produce

sequential plans.

Add these clauses in the method make_serial_mutex_axioms in cnf_encodings/basic.py.

4.6 Exercise 6: Interference Mutex Axioms (10 marks)

These clauses ensure that two actions a and a′ cannot be executed in parallel at a step t if they interfere.

The clauses have the same form as those generated in Q5, but they only apply to inteferring actions.

Two actions a and a′ interfere if there is a proposition p, such that p ∈ EFF−(a) and p ∈ PRE(a′) or

vice versa. To get full marks, you should not add clauses for interfering actions if their parallel execution

is already prevented by effect clauses due to inconsistent effects. Also, take the necessary precautions so

to avoid adding duplicate clauses.

Add these clauses in the method make_interference_mutex_axioms in cnf_encodings/basic.py.

4.7 Exercise 7: Reachable Action Axioms (5 marks)

The planner computes and uses the plangraph to further improve the encoding. A side-effect of computing

the plangraph is getting sound bounds on the first level at which actions can be executed. For each action

a, if we know that a cannot be executed before step t, then we can add the following clause for each step

t′ < t:

¬a@t′

Look in the method make_reachable_action_axioms in cnf_encodings/basic.py to see how to get

this reachability information.

4.8 Exercise 8: Fluent Mutex Axioms (5 marks)

Another side-effect of computing the plangraph is obtaining a set of fluent mutex relationships. These

tell us that certain pairs of propositions cannot both be true at a given step.

These clauses are not needed for correctness, but in some cases they can make planning much more

efficient!

Assert these mutex relationships with clauses along the lines of those for the action mutex relationships

in Questions 5 and 6.

See method make_fluent_mutex_axioms in cnf_encodings/basic.py for details.

4.9 Exercise 9: Extracting a Plan (5 marks)

Once the SAT solver has found a CNF instance to be satisfiable it returns a satisfying assignment to

the variables in this instance. As you created these variables, you are in a position to interpret this

satisfying assignment and build a plan from it!

This is as simple as finding the true action variables and inserting the corresponding actions into a plan

in an order which is consistent with its time step indices.

See the method build_plan in cnf_encodings/basic.py for details.

4.10 Exercise 10: Using control knowledge in SAT (20 Marks)

If you have successfully completed the first part of this assignment and tested the planning system

on a number of larger benchmark problem instances, you will probably agree that domain-independent

planning is hard! Of course, in general classical planning is PSPACE-complete, but even solving bounded

length instances is NP-complete.

However, for some domains there are domain-specific procedures to find (usually sub-optimal) plans in

polynomial time. For example, in Blocksworld we can find a plan which is guaranteed to be within

a factor of 2 of the length of an optimal plan by simply stacking all blocks onto the table and then

re-stacking them correctly.

In this part of the assignment you are going to leverage the hard work you have already done. You are

going to add some constraints on top of the basic encoding you developed in questions 1-6, and represent

domain-specific control knowledge for the Logistics domain (benchmarks/logistics).

Often control knowledge for planning problems is based on LTL (Linear Temporal Logic) and you might

get inspired by studying this. However, we do not expect you to implement an automatic compilation of

arbitrary LTL into SAT. LTL formulae can be proven to be useful to find plans quickly, see this reference

for more details.

With good control knowledge many problems can become easier to solve, at the expense of generality,

optimality and sometimes, even completeness when plans exist (the specified control knowledge may

well be conveying constraints on action execution that cannot be satisfied). On the other hand, control

knowledge can also be used to obtain plans of better quality by constraining the search space only to

seek for solutions conforming to specific constraints. Hopefully, once you have completed this part of

the assignment, your planner will be able to solve larger instances of the Logistics domain than was

possible with the basic encoding alone or to produce better plans on the average, or both.

Control Knowledge in Logistics

The Logistics domain is about finding a plan to use trucks and planes to move packages around. Each

package starts at some location and must be moved to some other goal location. There is a set of regions

(or cities), consisting of locations. Trucks can move packages around the locations within each region,

but airplanes are needed to move packages between regions.

The STRIPS Logistics domain abstracts away some of the complexities of this problem, but still leaves

an interesting and challenging planning domain.

For this exercise you will implement some additional planning constraints in the file

cnf_encodings/logistics_control.py

specifically for the Logistics problem. You can assume that the actions and propositions in the Problem

instance come from this planning domain (see benchmarks/logistics) for details.

For example in this domain, control knowledge rules can be used to restrict the way trucks, packages

and airplanes can move, but they should preserve SOME solution (the problems might be very easy to

solve if you added a contradiction, but wholly uninteresting!).

As an example rule to get you started, you could assert that if a package was at its destination, then

it cannot leave. That is you could iterate over the goal of the problem to find the propositions which

talk about where the packages should end up and make some constraints asserting that if one of these

propositions is true at step t then it must still be true at step t+ 1.

If you need to determine the type of an object or what cities locations are in, you can use string matching

against the names of objects (e.g. package1, city4-1). Have a look inside the .pddl files for more

details.

You will be marked based on the correctness and inventiveness of the control knowledge you devise, as

well as its effectiveness in conjunction with the basic encoding (exercises 1-6) without plangraph mutexes.

You should aim to make at least three different control rules. Feel free to leave in (but comment out)

rules which you abandon if you think they are interesting and want us to look at them.

Use the flag -e logistics to select this encoding when running the planner, and use the flag -p false

to disable plangraph mutexes. The execution semantics and options will work as normal. For example:

python3 planner.py benchmarks/logistics/domain.pddl benchmarks/logistics/problem05.pddl logistic1 13 -e

logistics -p false

The way you provide your solution can be both corroborated by theoretical results (by providing guar-

antee on the way the control rules limit the search space) or empirically by showing evidence that the

strategy you are proposing is effective for the planner to scale up (increase the number of Logistics

problems solved or coverage) or/and produce on the average better plans (plan length) with comparable

computational effort. Our evaluation is not merely quantitative but also and more importantly quali-

tative. You should motivate your decisions and highlight both their advantages and/or limitations (if

any).

Remember to put comments in your code and in report.pdf to explain your approaches.

4.11 Exercise 11: Understanding Planning Problems (20 Marks)

In this exercise you need to carry out an experimental analysis on a set of benchmark domains. The

empirical part in planning is a fundamental step in the development of a planning system as what is

really interesting is the behavior of the system on common situations, rather than in general. There

could in fact be no general guarantees as planning is still a PSPACE-Complete problem.

In this exercise you will study the various encoding techniques developed. In particular, you need to

consider serial vs parallel planning. For each type, there are four sub-configurations:

• No fluent mutex or reachable action axioms;

• Only fluent mutex axioms;

• Only reachable action axioms;

• Both fluent mutex and reachable action axioms.

You need to select three domains from the benchmark folder (blocks, depot, logistics, miconic, pipesworld,

and rovers). For each domain, you must understand and evaluate the implications of the various configu-

rations on the total time spent and the quality of the produced plans. Each problem instance corresponds

to solving a number of bounded planning problems. The number of time steps starts from 0 and keeps

going until a plan is found.

Note that there are 8 configurations to consider. We require you to select 3 domains for which you

will try to solve the first 10 instances. This task amounts at solving 240 planning problems, so we

highly recommend you to have some kind of python or bash script to launch the experiments. If you set

a 100-second timeout, you will gather all the experimental results in 6 hours in the worst case. To set

the timeout, you can use the timeout command in Linux. For example:

timeout 100 python3 -u planner.py benchmarks/depot/domain.pddl benchmarks/depot/problem05.pddl depot_temp

1:30:1 -x parallel -l both -p false -q ramp | tee depot-1-05.log

The tee command writes the output to a log file. You need Python’s -u argument for tee to work. For

Mac, replace timeout with gtimeout. If gtimeout is not yet installed, run brew install coreutils.

It is up to you to figure out the best way of explaining this behaviour. Make use of graphs and tables if

needed. Marks will be awarded according to the quality of the presentation, the thoroughness of details,

and depth of analysis.

Another thing to consider is the query strategy which decides what is the sequence of horizons to try in

order to find a plan. Is the serial query strategy always the right answer? Can we do something more

intelligent?

Put your experimental results and discussion in report.pdf. Also submit a script called run_experiments.py

or run_experiments.sh that can be used to reproduce the numbers in your report.

And that’s the end of Assignment 3 :-)

QQ：99515681
WeChat：codinghelp
Email：99515681@qq.com
Work Time：8:00-23:00

Hots

Ghostwriter Cs1b Spring 2024 Tth Hw08h... 2024-04-19
Help With Managing Financial Risk Prob... 2024-04-19
Ghostwriter Cs 0449 – Project 5: /Dev/ 2024-04-19
Ghostwriter Elec 2141 Digital Circuit ... 2024-04-19
Help With Csc171 — Videogame Projecthe 2024-04-19
Help With Comp3411 Artificial Intellig 2024-04-19
Help With Stat3061: Random Processes &... 2024-04-19
Ghostwriter Accounting 452, Spring 202... 2024-04-19
Ghostwriter Finc5001 Foundations In Fi... 2024-04-19
Ghostwriter 7Ssmm712 – Topics In Appli 2024-04-19
Help With Com 337 - Film Studies For T... 2024-04-19
Ghostwriter Mes202tc - Digital Vlsi Sy... 2024-04-19
Ghostwriter Geography 2041B Distance S... 2024-04-19
Ghostwriter Ecos3006 International Tra... 2024-04-19
Help With Fit5225 2024 Sm1 Creating An... 2024-04-19
Help With Cit 593: Introduction To Com... 2024-04-19
Help With Math 4931: Take Home Examgho... 2024-04-19
Ghostwriter Csci 547|Info 533: Systems... 2024-04-19
Ghostwriter Cs536-S24 Intro To Pls And... 2024-04-19
Help With Fit5212 - Assignment 1Ghostw... 2024-04-19

Programming Assignment Help！