The required reading is the relevant sections from
http://rakaposhi.eas.asu.edu/cse494/notes/ch2.pdf
you are also welcome to read the optional readings or Kleinberg's JACM paper and Page/Brin/Motwani's unplublished pagerank paper.
rao
Class blog for CSE 574 Planning & Learning (ASU)
Bring your slides for today on a memory stick. We will load them on
the class computer in the beginning so the presentations can be
streamlined.
thanks
rao
rao
---------- Forwarded message ----------
From: William Cushing <william.cushing@gmail.com>
Date: Tue, Apr 29, 2008 at 8:34 AM
Subject: Fwd: Real world blocksworld
To: "Plan-Yochan@Parichaalak. Eas. Asu. Edu"
<plan-yochan@parichaalak.eas.asu.edu>
Indeed, Hector took the picture. "Optimizing the Steel Plate Storage
Yard Crane Selection Problem", ICAPS 2007 (DC Poster). I believe a
short paper version exists in the proceedings somewhere...
picture: http://picasaweb.google.es/hectorpal/Icaps2007/photo#5116899415236524914
that group's publication list:
http://www2.imm.dtu.dk/pubdb/public/publications.php?cmd=full_view&pubtype=§ion=7
A related journal article on blockstacking (no joke, that's one of the
keywords):
http://www.springerlink.com/content/f60826852715v545/
(I attached the pdf if you don't have access to springer)
-Will
---------- Forwarded message ----------
From: William Cushing <william.cushing@gmail.com>
Date: Tue, Oct 2, 2007 at 7:07 PM
Subject: Real world blocksworld
To: plan-yochan <plan-yochan@parichaalak.eas.asu.edu>
http://picasaweb.google.es/hectorpal/Icaps2007/photo#5116899415236524914
This poster is on a domain which is very blocksworld-ish. Also has
towers-of-hanois-ish in that there is no table of infinite capacity;
only stacks. However, there appear to be no stacking restrictions
(objects don't have different weights or sizes that require sorted
stacks).
The steel plates *do* have individual names which distinguish them, a
highly important technical point (in the comparison with blocksworld).
The goal state is not a stacking configuration; the goal is to
minimize movement over time (as I understand it), each day one is to
deliver certain plates to a conveyer at the far right of the
coordinate system. One has a nominal schedule when plates will
arrive and when they will be requested, allowing one to subgoal on
intermediate stacking configurations (or rather a sequence of stacking
configurations) which minimizes movements while achieving those
schedules. (and then there is some online planning for each day as
requests and supplies deviate from nominal).
So many interesting components that bar applying FF directly to the
problem...but....a big core part of the technical problem *is*
blocksworld. On ~1000 "blocks", in case anyone likes to think of
planning + scaling.
-Will
thanks
rao
For 4/29 (i.e., next Tuesday's class), I will take about 10 min to
do a "here is what we did" spiel. The rest of the class will be spent
with
each of you making a 5-min presentation about your project findings.
This will also serve as a quick summary of your report (that you will
also
bring to the class that day). Ideally, you make this presentation as
an advertisement to draw people to read your report (you will also
give me electronic--
pdf and ppt versions) of the report and talk.
I think there is a reasonable amount of diversity in the projects that
it is worth everyone hearing summaries of what others did).
thanks
Rao
-------------------
Subbarao Kambhampati
http://rakaposhi.eas.asu.edu/rao.html
http://www2.parc.com/isl/members/minhdo/publications/2008/do.pdf
rao
---------- Forwarded message ----------
From: Subbarao Kambhampati <subbarao2z2@gmail.com>
Date: Wed, Apr 23, 2008 at 2:26 PM
Subject: [CSE574 Planning & Learning] Reading for tomorrow (as well as
slides and audi...
To: subbarao2z2@gmail.com
Folks
For Minh Do's guest lecture tomorrow, you might want to read the
following paper
http://www.aaai.org/Conferences/AAAI/2008/aaai08nectar.php
(it is
only 5 pages!)
(Additional related papers are on his homepage at
http://www2.parc.com/isl/members/minhdo/ )
----------
Also, for those of you who missed Biplav Srivastava's talk on web
service composition today, his slides
and audio are up on the class notes page http://rakaposhi.eas.asu.edu/cse574
see you tomorrow.
Rao
--
Posted By Subbarao Kambhampati to CSE574 Planning & Learning at
4/23/2008 02:26:00 PM
For Minh Do's guest lecture tomorrow, you might want to read the
following paper
http://www.aaai.org/Conferences/AAAI/2008/aaai08nectar.php
(it is
only 5 pages!)
(Additional related papers are on his homepage at
http://www2.parc.com/isl/members/minhdo/ )
----------
Also, for those of you who missed Biplav Srivastava's talk on web
service composition today, his slides
and audio are up on the class notes page http://rakaposhi.eas.asu.edu/cse574
see you tomorrow.
Rao
thanks
rao
---------- Forwarded message ----------
From: James Collofello <JAMES.COLLOFELLO@asu.edu>
Date: Wed, Apr 23, 2008 at 9:59 AM
Subject: teaching evaluations
To: "DL.WG.CEAS.Faculty" <DL.WG.CEAS.Faculty@mainex1.asu.edu>
Cc: Ann Zell <ann.zell@asu.edu>
Colleagues,
The Spring 2008 teaching evaluations became available to students on
Mon 4/21, around 10:30 a.m. and will close at Wed 4/30 (reading day)
at 12:00 midnight. Students will be able to access the evaluation
tool at: https://fultonapps.asu.edu/eval
Please encourage your students to complete the evaluations, especially
the students who enjoy your class, or they will face several nagging
email requests.
jim
James S. Collofello
Associate Dean for Academic and Student Affairs
Ira A. Fulton School of Engineering
FYI, I revised and expanded the plan recognition slides from the
first lecture based on the actual discussion in the class.
rao
Speaker:
Dr. Biplav Srivastava
IBM Research
Time: 11AM--12AM; Wednesday April 23rd; BY 510
Abstract:
Divide-and-conquer or working with complex systems from their
basic building blocks is one of the basic tenets of modern engineering.
While its applicability to Information Technology has always been
felt – example Object Oriented Methodology, its success has been
limited. There is a resurgence of interest in componentization of
IT systems and services through focus on Service Oriented Architecture,
and Web Services as its most popular form. Consequently, Web Services
has received wide attention in both academia and IT industry over
the past 5-7 years. The attractiveness of this technology lies in
the fact that the specifications of the building
blocks (i.e., services) are openly available in a registry and
so are the building blocks themselves. So, the promise is that a user
can build (or modify) an application by composing (or re-composing)
components whose specification it discovers from the registry
and whose capabilities it can access whenever needed. Depending on
what is defined as a service, web services composition can enable
many IT issues -- Mashups, Asset Reuse, Business-to-IT alignment,
Business-to-Business and Enterprise Application Integration, ...
In this talk, we will look at where the hardness of automatically
composing web services comes from in practice and how traditional
Computer Science techniques (notably planning) have fared. While
the original myth was that composition would be hard, in reality,
most composition scenarios did not demand scalability of the
top-of-the-line planning algorithms. However, what has turned
out to be harder than composition is how to set up the composition
problem as a traditional Computer Science (notably planning) problem.
Two trends are emerging to address this: the composition problem is
often cast as a plan reuse and modification problem in the context of
richer domain models (e.g., Industry Business Processes), and new
composition/ planning paradigms like model-lite planning which are
resilient to impoverished domain models.
-----------
About Biplav:
Dr. Biplav Srivastava is a Research Staff Member at IBM Research since
February 2001. Though based at IBM's India Research Laboratory, Biplav
is on assignment to IBM's T.J.Watson Research Center in Hawthorne, NY, USA.
Biplav's research interests are in planning, scheduling, policies,
learning and information management, and their practical applications
in services -- infrastructure and software (web services), semantic web,
autonomic computing and societal domains. Prior to IBM Research, he
was Core Technology Architect at an erstwhile Silicon Valley
start-up, Bodha, eventually acquired by SAP (2000-2001; process integration),
Staff Software Engineer at VLSI/ Philips Semiconductors (1996-2000;
electornic design automation) and Assistant System Analyst at TCS,
India (1993-1994).
cheers
Rao
This is the first of the two application talks planned for this week.
The second one will be in class on Thursday. Please make every effort
to attend. -Rao]
Title:
The Myth and Reality of Web Services Composition
Speaker:
Dr. Biplav Srivastava
IBM Research
Time: 11AM--12AM; Wednesday April 23rd; BY 510
Abstract:
Divide-and-conquer or working with complex systems from their
basic building blocks is one of the basic tenets of modern engineering.
While its applicability to Information Technology has always been
felt – example Object Oriented Methodology, its success has been
limited. There is a resurgence of interest in componentization of
IT systems and services through focus on Service Oriented Architecture,
and Web Services as its most popular form. Consequently, Web Services
has received wide attention in both academia and IT industry over
the past 5-7 years. The attractiveness of this technology lies in
the fact that the specifications of the building
blocks (i.e., services) are openly available in a registry and
so are the building blocks themselves. So, the promise is that a user
can build (or modify) an application by composing (or re-composing)
components whose specification it discovers from the registry
and whose capabilities it can access whenever needed. Depending on
what is defined as a service, web services composition can enable
many IT issues -- Mashups, Asset Reuse, Business-to-IT alignment,
Business-to-Business and Enterprise Application Integration, ...
In this talk, we will look at where the hardness of automatically
composing web services comes from in practice and how traditional
Computer Science techniques (notably planning) have fared. While
the original myth was that composition would be hard, in reality,
most composition scenarios did not demand scalability of the
top-of-the-line planning algorithms. However, what has turned
out to be harder than composition is how to set up the composition
problem as a traditional Computer Science (notably planning) problem.
Two trends are emerging to address this: the composition problem is
often cast as a plan reuse and modification problem in the context of
richer domain models (e.g., Industry Business Processes), and new
composition/ planning paradigms like model-lite planning which are
resilient to impoverished domain models.
-----------
About Biplav:
Dr. Biplav Srivastava is a Research Staff Member at IBM Research since
February 2001. Though based at IBM's India Research Laboratory, Biplav
is on assignment to IBM's T.J.Watson Research Center in Hawthorne, NY, USA.
Biplav's research interests are in planning, scheduling, policies,
learning and information management, and their practical applications
in services -- infrastructure and software (web services), semantic web,
autonomic computing and societal domains. Prior to IBM Research, he
was Core Technology Architect at an erstwhile Silicon Valley
start-up, Bodha, eventually acquired by SAP (2000-2001; process integration),
Staff Software Engineer at VLSI/ Philips Semiconductors (1996-2000;
electornic design automation) and Assistant System Analyst at TCS,
India (1993-1994).
Speaker: Minh B. Do, (Xerox) PARC Labs, Palo Alto
Date/Time: Thursday 4/.24 3:15pm BY 190
Abstract: This talk summarizes the recent work at the Embedded
Reasoning System at PARC on applying automated planning techniques to
the control of modular production printing equipments. These
reconfigurable printers radically change the traditional design by
using simpler, interchangeable, but smarter components. Like many
other real-world applications, such as mobile robotics, this complex
domain requires real-time autonomous decision-making and robust
continual operation. To our knowledge, this work represents the first
successful industrial application of embedded domain-independent
temporal planning. Main challenges of applying automated planning
technology in this domain include compositional modeling, on-line
planning and exception handling, real-time planner control, and the
interaction with low-level controller. At the heart of our system is
an on-line algorithm that combines techniques from state-space
planning and partial-order scheduling. For example, our
planning-graph-based planning heuristic takes resource contention into
account when estimating makespan remaining. We suggest that this
general architecture may prove useful as more intelligent systems
operate in continual, online settings. Our system has been used to
drive several commercial prototypes and numerous hypothetical (but
realistic) printer configurations. When compared with
competition-winning state-of-the-art off-line planners in this domain,
our system is hundreds of times faster and often finds much better
quality plans. At the end of the talk, I will also discuss current
extensions of our current planning framework to other online planning
domains that share similar characteristics and also to objective
functions beyond the default maximization for machine productivity.
Bio: Minh Do is a Research Staff in the Embedded Reasoning Area at the
Palo Alto Research Center (formerly Xerox PARC). He graduated from the
Yochan planning group at Arizona State University in 2004 and has been
working on transferring his knowledge in offline domain-independent
metric temporal planning into fast online continual planning
applications. Besides temporal and online planning, Minh Do has
worked on other planning topics such as over-subscription planning,
planning as CSP/ILP/SAT, and integrating planning and diagnosis. He
has published a few dozens papers, filed several patents on automated
planning and co-authored the ICAPS best application paper award on
planning for high-speed modular printer control. This year, he is
co-chairing the deterministic track of the 6th International Planning
Competition.
rao
On Mon, Apr 21, 2008 at 1:50 PM, Aishwarya Sivaraman <asivaram@asu.edu> wrote:
> Dr. Rao,
>
> I am unable to open the longer versions' link. I get a 404 error.
>
> http://rakaposhi.eas.asu.edu/cse574/notes/asst.pdf
>
>
> Regards,
> Aishwarya
>
>
> On Mon, Apr 21, 2008 at 1:23 PM, Subbarao Kambhampati
> <SUBBARAO.KAMBHAMPATI@asu.edu> wrote:
>
> > For the decision theoretic assistant paper, if you find the short one
> > too succinct, you might
> > consider the longer version here
> >
> > http://rakaposhi.eas.asu.edu/cse574/notes/asst.pdf
> >
> > rao
> >
> >
> > On Sun, Apr 20, 2008 at 9:32 PM, Subbarao Kambhampati
> > <subbarao2z2@gmail.com> wrote:
> > > Folks
> > > Here are two readings for next class:
> > >
> > > Primary: http://www.cs.orst.edu/%7Eafern/papers/ijcai07-assistant.pdf
> > >
> > >
> > > Secondary:
> > >
> > > http://www.cs.rochester.edu/~kautz/papers/gps-tracking.pdf
> > >
> > >
> > > Rao
> > >
> > > --
> > > Posted By Subbarao Kambhampati to CSE574 Planning & Learning at
> 4/20/2008
> > > 09:32:00 PM
> > >
> > >
> > >
> > >
> > >
> > >
> >
>
>
http://rakaposhi.eas.asu.edu/cse574/notes/asst.pdf
rao
On Sun, Apr 20, 2008 at 9:32 PM, Subbarao Kambhampati
<subbarao2z2@gmail.com> wrote:
> Folks
> Here are two readings for next class:
>
> Primary: http://www.cs.orst.edu/%7Eafern/papers/ijcai07-assistant.pdf
>
>
> Secondary:
>
> http://www.cs.rochester.edu/~kautz/papers/gps-tracking.pdf
>
>
> Rao
>
> --
> Posted By Subbarao Kambhampati to CSE574 Planning & Learning at 4/20/2008
> 09:32:00 PM
>
>
>
>
>
>
Primary: http://www.cs.orst.edu/%7Eafern/papers/ijcai07-assistant.pdf
Secondary:
http://www.cs.rochester.edu/~kautz/papers/gps-tracking.pdf
Rao
In particular, we saw that classical planning can be seen as A*
search, and belief-space planning can be seen as
AO* search. Typical AO* search algorithms work on acyclic graphs (note
that AO* can be seen as a
problem decomposition framework, and cycles imply that you are
reducing a problem indirectly to itself).
The LAO* paper below shows that FOMDP policy construction can be seen
as AO* search on cyclic
graphs.
LAO*
http://www.cse.msstate.edu/~hansen/papers/laostar.pdf
Another idea for viewing value function computation is in terms of
fixed-depth expansion under a node
(as in game trees--in fact, in 471, I motivated game trees in terms of
RTDPs). The LRTDP algorithm
improves a bit on RTDP
LRTDP
http://ftp.cs.ucla.edu/pub/stat_ser/R319.pdf
rao
thanks
rao
---------- Forwarded message ----------
From: Subbarao Kambhampati <rao@asu.edu>
Date: Thu, Mar 27, 2008 at 6:12 PM
Subject: Reading for next class: Decision Theoretic Planning
To: Rao Kambhampati <rao@asu.edu>
Here is the paper for the next week.
http://www.cs.cmu.edu/afs/cs/project/jair/pub/volume11/boutilier99a.pdf
You should read at least upto page 35 for next week.
Rao
Planning for Interactions among Autonomous Agents
Dana S. Nau
Department of Computer Science
University of Maryland
College Park, MD
Date: Monday, March 31, 2008
Time: 11:30 AM - 12:45 PM
Place: Brickyard Building, Room 210
This talk will focus on ways to plan an autonomous agent's
interactions with other autonomous agents. Sometimes it is feasible to
model the other agents' possible actions as nondeterministic outcomes
of our agent's own actions. For this case, we can plan how to achieve
a desired goal using symbolic model checking, HTN planning, or a
combination of the two. Sometimes it may not be feasible to generate a
plan or policy that goes all the way to a goal, either because the
search space is too large or the goal is ambiguously defined. For such
cases, it can work well to interleave planning and execution if we
have a good predictive model of how the other agents are likely to
behave. The talk will present theoretical foundations and algorithms
for the above situations, and experimental results on the
Hunter-and-Prey domain, the Iterated Prisoner's Dilemma with Noise,
and other multi-agent planning domains.
Biography
Dana Nau is a Professor of both Computer Science and Systems Research
at the University of Maryland, and is co-director of the university's
Laboratory for Computational Cultural Dynamics.He has more than 300
refereed technical publications on automated planning, search
algorithms, game theory, and other topics.He has received an NSF PYI
award, several best-paper awards, and several prizes for the
performance of software systems.He is a Fellow of the Association for
the Advancement of Artificial Intelligence (AAAI).
==============
--
Posted By Subbarao Kambhampati to CSE574 Planning & Learning at
3/31/2008 06:42:00 AM
Dana S. Nau
Department of Computer Science
University of Maryland
College Park, MD
Date: Monday, March 31, 2008
Time: 11:30 AM - 12:45 PM
Place: Brickyard Building, Room 210
This talk will focus on ways to plan an autonomous agent's
interactions with other autonomous agents. Sometimes it is feasible to
model the other agents' possible actions as nondeterministic outcomes
of our agent's own actions. For this case, we can plan how to achieve
a desired goal using symbolic model checking, HTN planning, or a
combination of the two. Sometimes it may not be feasible to generate a
plan or policy that goes all the way to a goal, either because the
search space is too large or the goal is ambiguously defined. For such
cases, it can work well to interleave planning and execution if we
have a good predictive model of how the other agents are likely to
behave. The talk will present theoretical foundations and algorithms
for the above situations, and experimental results on the
Hunter-and-Prey domain, the Iterated Prisoner's Dilemma with Noise,
and other multi-agent planning domains.
Biography
Dana Nau is a Professor of both Computer Science and Systems Research
at the University of Maryland, and is co-director of the university's
Laboratory for Computational Cultural Dynamics.He has more than 300
refereed technical publications on automated planning, search
algorithms, game theory, and other topics.He has received an NSF PYI
award, several best-paper awards, and several prizes for the
performance of software systems.He is a Fellow of the Association for
the Advancement of Artificial Intelligence (AAAI).
==============
Now that I have an idea of what semester projects are being attempted
by the various students, I am beginning to wonder
what the grade should be based on in the event that the semester
project doesn't end up all that well.
Now, in real world, if your project doesn't work, your paper will just
be rejected and that is that. However, since this is a course
and you need a letter grade, we need some basis for that. [There is
the option of just giving everybody the same high grade--but I am
afraid that might adversely affect my popularity...]
As of now, we had one homework, and one mini-project. I definitely
plan to release a second homework to cover everything since the
first homework.
In addition to that, I am thinking that there should be some sort of
"breadth of understanding" assessment. We can do this assessment
either around now (specifically, there is a good chance I will miss
the class of April 7th-- and that is always a goodtime to have an
exam), or
at the end of the semester during the finals week.
My assumption is that if someone does a great semester project, that
trumps over everything else. However, if the semester project doesn't
work out all that well--these other assessment mechanisms can help in
giving a grade.
I solicit your opinion(s) on this (either on the blog or by anonymous
email to http://rakaposhi.eas.asu.edu/cgi-bin/mail?rao )
thanks
Rao
One relevant issue here is that if you *know* the state of the world
in some aspect, and you know that
you haven't done anything to change it, then there should be no reason
for you to check it again.
Assuming a single-agent world, if you start with complete state and do
deterministic actions, then you never have
to look (as we discussed). Even if we start with incomplete initial
state, we may know "everything" about *some aspects*
of the world. As long as the ensuing actions--of ourselves--don't
modify the completeness of that knowledge, we don't
need to look *for that aspect* of the world.
It all starts to look like managing "closed world" assumptions. In the clasical
planning, we know init state, and so we can start with closed-world
assumption. No actions modify that closed world assumption.
In general belief-space planning, we don't have full closed world
assumption, but may have *partial* closed world assumption. As
we do actions, we may lose or acquire closed world assumption. In a
desktop (or unix) world, for example, we may start knowing
names of the files in the current directly as well as the sizes of all
the files in the directory. After we run a latex command, we still
know
the names of all files in the directory (even though latex makes new
files-- we know what they will be -- .aux, .bbl etc). We however no
longer
know the sizes of all files (since the sizes of .aux and .bbl files
generated will depend on the files you latexed in a complex way and
you can't
model it a priori). So, we lose closed world knowledge of the file
sizes. If we need that, we will have to do an "ls -s" action (a
sensing action).
If we just happen to do "rm *" action in that directory, we again get
full knowledge on both files and sizes of the directory.
In the paper below
http://www.cs.washington.edu/homes/etzioni/papers/xii-aaai94.pdf
Golden et. al. formalize this notion of starting with and tracking
local closedworld assumptions (LCWs) .
Their main contribution is to note not only the normal effects of the
actions, but also their meta-effects on closed world assumptions
(e.g. see the latexing and rm'ing actions above). Neat paper to read.
=========================================
2. When discussing progression planning in the presence of sensing
actions, I pointed out that there are two non-deterministic
branches: one which picks a causative action to execute and the other
which picks a sensing action to execute.
I also mentioned that if you always pick the causative action branch,
you get "conformant" or "no-sensing" plans (if you succeed).
A related question is what happens if you always pick only sensing
action branch? You get a pure sensing plan.
We can see a use for "pure causative plan" (conformant plan)--an agent
which has no sensors has to deal just with those.
Of what use can pure sensing plans be?
Well--if all you can do is sense some database, then your plans will
be just pure sensing actions. In particular, plans whose main purpose
is to
gather information can be thought of in terms of pure (or almost
entirely) sensing plans.
When you do planning on the web--for example--most often, your actions
involve sensing (look at this database, take a value from there and
plug it into a sensing query for this other database etc.) that leave
the world as it is, and only modify *your knowledge* of it.
Of course, you can also sometimes have causative actions (e..g.
updates--credit card
transactions etc) that modify the state of some database, and not just
the state of your knowledge.
So, not surprisingly, planning for information gathering involves
mostly things of pure sensing actions. The good part about sensing
actions is
that there are never any negative interactions among them (your brain
doesn't explode because you learned knowledge in the wrong order ;-).
So, for sensing planning the big issue is not so much about subgoal
interactions, but rather about reducing sensing.
Not surprisingly, the LCW stuff discussed above--wind up being relevant.
See http://rakaposhi.eas.asu.edu/ijcai-ig.pdf
which talks about how LCW information can be used to reduce the number
of information sources
that the agent has to sense to get all answers for a query.
(The paper http://rakaposhi.eas.asu.edu/ig-tr.pdf
provides a somewhat dated tutorial introduction to planning for
information gathering).
Rao
http://www.cs.cmu.edu/afs/cs/project/jair/pub/volume11/boutilier99a.pdf
You should read at least upto page 35 for next week.
Rao
http://rakaposhi.eas.asu.edu/pgSurvey.pdf
(the AI Magazine tutorial paper on PG heuristics that you already
printed and read a part of
back when we were doing classical planning heuristics)
Rao
On Tue, Mar 25, 2008 at 5:44 PM, Subbarao Kambhampati <rao@asu.edu> wrote:
>
> http://rakaposhi.eas.asu.edu/dan-jair-pond.pdf
>
> (As an added bonus, section 2 provides an overview of the progression
> and regression with and without sensing actions in belief space)
>
> Rao
>
(As an added bonus, section 2 provides an overview of the progression
and regression with and without sensing actions in belief space)
Rao
rao
Complexity under partial observability/non-determinism
http://users.rsise.anu.edu.au/~jussi/RintanenICAPS04.pdf
Complexity of temporal planning
http://users.rsise.anu.edu.au/~jussi/Rintanen07icaps.pdf
Rao
http://rakaposhi.eas.asu.edu/cse574/aips00-incomplete.pdf
Rao
Regarding the relative weight of the assignments--I am not too sure
yet. In general
I think about [30- 40%] for homeworks etc, [10-20%] for participation
and [40% -50%] for semester project.
By the way, I should tell you that one idea I am toying with is to do
an oral midterm
(Y'all get to come for half-an-hour each and the teaching czars and I
will "interview" you
on what happened until now).
Rao
On Tue, Mar 4, 2008 at 7:59 PM, Oleg Bakun <Oleg.Bakun@asu.edu> wrote:
> Hello,
> When grades and their averages for HW1 and Project 1
> will be published?
> What is relative weight of each assignment?
>
> Oleg Bakun
>
>
>
>
> ____________________________________________________________________________________
> Looking for last minute shopping deals?
> Find them fast with Yahoo! Search.
http://tools.search.yahoo.com/newsearch/category.php?category=shopping
>
FYI
Rao
--> discuss any questions you might have on TCSPs (especially handling of STPs)
--> Discuss next topic: Scheduling
Reading:
Required: (required and in order)
Dana Nau text book 15.1--15.3
If possible, also read
http://rakaposhi.eas.asu.edu/cse574/smith-cheng-slack-aaai93.pdf (6
page paper on a scheduler)
============
Also, after the break, we will have a class to discuss the paper
http://rakaposhi.eas.asu.edu/cse574/KER00.pdf
(which talks about integrating planning and scheduling)
http://rakaposhi.eas.asu.edu/cse574/s08-feedback.pdf
They should give you a more complete idea of what the class thinks and whether
it jives with your own view.
If you think the comments have a different meaning that I spun them for,
feel free to tell me that ( http://rakaposhi.eas.asu.edu/cgi-bin/mail?rao
)
regards
Rao
Today is your last chance to send the midterm feedback. I will
summarize what I got tomorrow (and discuss in the
class if needed).
I got feedback from 5 students. The class has 12 registered
students--so this is not even a majority.
If it helps you galvanize to action, let me teasingly say that the
majority feedback received till now says
everything is upsy-daisy and requests if any thing for more homeworks and exams.
Rao
---------- Forwarded message ----------
From: Subbarao Kambhampati <rao@asu.edu>
Date: Tue, Feb 26, 2008 at 9:20 PM
Subject: Feedback opportunity to tell me how the class is going
To: Rao Kambhampati <rao@asu.edu>
Folks
Now that 6 weeks are over, I thought it would be good to poll y'all
on what is working and what is not working.
Please feel free to send your comments.
You can send comments either via anonymous web mail
http://rakaposhi.eas.asu.edu/cgi-bin/mail?rao
or by bringing a printed sheet to the class on Wednesday. Note that
the webmail records your ip address--so
you may want to send it from some ip address that is generic.
Here are some things I am interested in finding out. Other comments
welcome too:
1. Are the lectures too fast/too slow/too high level/too low level etc?
2. Are you actually able to connect your readings to lectures?
3. Are you reading before or after the topics are discussed?
4. Should there be more assessment (homeworks/projects etc)?
5. How is the progress towards semester project coming (qualitatively
speaking--
you will have an opportunity for detailed answers anyway)
6. Are the classes interacive enough or should they encourage more discussion?
If the latter, what can we do other than stopping for questions?
7. Overall, how does it compare to other graduate level courses you took/taking
(positively or negatively)
regards
Rao
For the first part, I\in addition to the SAPA paper and IJCAI 2007
paper readings that you are already in charge of for last class, you
can also look at the Zeno paper
http://rakaposhi.eas.asu.edu/cse574/zeno.pdf
For the second part (temporal networks) you can look at
http://rakaposhi.eas.asu.edu/cse574/tcn-meiri-dechter-aij.pdf
rao
Starting this week, you are required to make observations on the blog
about things
learned in the class (and readings) every week. These can be questions
you had while thinking
the material over, or skeptical comments or connections between the
week's topics and other areas.
Since we are no longer having note-taking, and have also not been
doing summaries of
papers before the class, I think there hasn't been much discussion of
things done.
In my 4-level classes, I start the discussion with "blog questions"--I
thought this will happen
automatically from your side for this class. And yet, there never seem
to be any questions.
So, I thought some creative pressure should be brought to bear.
Let us see blog posts..
regards
Rao
A 2-5 page summary and status of your semester project is due in
class on next Thursday 3/6.
I realize that different students are at different stages of progress
in this. Nevertheless, the
3/6 deadline is meant to give you as well as me a clear indication of
where things stand.
In case where you still not zeroed in on a specific topic, I want
your report to give enough details
on what you have done to try to zero in.
thanks
rao
http://rakaposhi.eas.asu.edu/real-temporal-ijcai.pdf
is the main reading. The first part of it re-states the required
concurrency property we talked about yesterday
The second part discusses the decision epoch planning at a high level.
For a more complete description of one decision epoch planner, see
Sections 1 and 2 of
http://rakaposhi.eas.asu.edu/do03a.pdf
Rao
Now that 6 weeks are over, I thought it would be good to poll y'all
on what is working and what is not working.
Please feel free to send your comments.
You can send comments either via anonymous web mail
http://rakaposhi.eas.asu.edu/cgi-bin/mail?rao
or by bringing a printed sheet to the class on Wednesday. Note that
the webmail records your ip address--so
you may want to send it from some ip address that is generic.
Here are some things I am interested in finding out. Other comments welcome too:
1. Are the lectures too fast/too slow/too high level/too low level etc?
2. Are you actually able to connect your readings to lectures?
3. Are you reading before or after the topics are discussed?
4. Should there be more assessment (homeworks/projects etc)?
5. How is the progress towards semester project coming (qualitatively speaking--
you will have an opportunity for detailed answers anyway)
6. Are the classes interacive enough or should they encourage more discussion?
If the latter, what can we do other than stopping for questions?
7. Overall, how does it compare to other graduate level courses you took/taking
(positively or negatively)
regards
Rao
For this problem, you will use the planning graph in the last level
III.b above.
IV.a. Convert the planning graph into a CSP encoding (the problem is
small enough that you can write the entire encoding down). Show a solution
for this CSP encoding, and show how it corresponds to a plan.
IV.b. Do IV.a. but with SAT encoding of the planing graph.
IV.c.
Do an "explanatory axiom" (or backward proof based) encoding
of this problem (for the same length as the planning graph you used
in the previous parts). Mark which constraints are similar, different,
stronger etc. compared to IV.b.
-------------------
On 2/19/08, Jonathan.D.Gibbs@asu.edu <Jonathan.D.Gibbs@asu.edu> wrote:
> Okay, so here is my question again.
> In parts a and b of Question 4, we are asked to encode the planning graph from "II.b" in CSP and SAT; part c, meanwhile, asks us create a third encoding and compare its constraints to "III.b." Are these intended to reference the planning graph with mutex-propagation in Question III.b (which is derived for a domain having only 2 actions and 4 variables), or the "standard mutex graph... used by planning graph" in Question II.b (which is derived for a domain with 5 actions and 5 variables)?
>
> -Jonathan Gibbs
> jondg@cox.net
> OR
> Jonathan.D.Gibbs@asu.edu
>
Required
Russell & Norvig: 12.5 & 12.6
Pell et. al. Robust planning and replanning for spacecraft..
http://rakaposhi.eas.asu.edu/cse574/pell97robust.pdf
suggested
Replanning: A new perspective
by Cushing & Kambhampati
http://rakaposhi.eas.asu.edu/replan-will.pdf
Monitoring optimality during execution
by Fritz & McIlraith
http://www.cs.toronto.edu/~fritz/publications/papers/fri-mci-icaps07.pdf
On 2/15/08, Nan Li <Nan.Li.3@asu.edu> wrote:
> Dear Dr. Rao,
>
> I cannot open the file downloaded from the first link. Could you please
> check that? Thank you.
>
> Best,
> Nan
>
>
>
> On Thu, Feb 14, 2008 at 7:32 PM, Subbarao Kambhampati <rao@asu.edu> wrote:
>
> > Here are optional readings for references made in the class
> >
> > --> A short (4 page) paper on FF (explains the enforced hill climbing
> > and other little changes it makes on top of relaxed plan heuristic)
> >
> > http://rakaposhi.eas.asu.edu/cse574/ff-aimag.pdf
> >
> >
> > -->A (10 page) paper on Fast-Downward (that detects multi-valued
> > fluents masquerading as boolean ones, and uses relaxed planning graph
> > ideas after that detection)
> >
> >
> ftp://ftp.informatik.uni-freiburg.de/documents/papers/ki/helmert-icaps04.pdf
> >
> > --> A short (3 page) paper on Macro-FF, that selectively learns
> > macros on top of FF
> >
> http://www.cs.ualberta.ca/%7Eadib/Research/Planning/ipc4Alberta.pdf
> >
> > --> A paper on learning search control rules from failures using EBL
> >
> > http://rakaposhi.eas.asu.edu/ebl-po-aaai94.pdf
> >
> > (a longer but perhaps cleaner description is
> >
http://rakaposhi.eas.asu.edu/ebljpap.pdf
> >
> > also, for relations between EBL, dependency directed backtracking in
> > CSP (no-good learning) as well as planning, see
> >
> > http://rakaposhi.eas.asu.edu/jour-ddb.pdf )
> >
> >
> > Rao
> >
>
>
http://rakaposhi.eas.asu.edu/psp-aaai04.pdf
Rao
--> A short (4 page) paper on FF (explains the enforced hill climbing
and other little changes it makes on top of relaxed plan heuristic)
http://rakaposhi.eas.asu.edu/cse574/ff-aimag.pdf
-->A (10 page) paper on Fast-Downward (that detects multi-valued
fluents masquerading as boolean ones, and uses relaxed planning graph
ideas after that detection)
ftp://ftp.informatik.uni-freiburg.de/documents/papers/ki/helmert-icaps04.pdf
--> A short (3 page) paper on Macro-FF, that selectively learns
macros on top of FF
http://www.cs.ualberta.ca/%7Eadib/Research/Planning/ipc4Alberta.pdf
--> A paper on learning search control rules from failures using EBL
http://rakaposhi.eas.asu.edu/ebl-po-aaai94.pdf
(a longer but perhaps cleaner description is
http://rakaposhi.eas.asu.edu/ebljpap.pdf
also, for relations between EBL, dependency directed backtracking in
CSP (no-good learning) as well as planning, see
http://rakaposhi.eas.asu.edu/jour-ddb.pdf )
Rao
In addition, you will also be asked to upload the pddl files of your
domains onto the class wiki.
rao
In a change of plan, I decided that it would be good to do some
discussion on the status of learning techniques--specifically those
aimed at
learning additional domain knowledge to improve planning performance.
This is something that was in the back of our minds as we kept talking
about HTNs, TLPlan rules
etc.
Although the topic of learning techniques for planning, will, I hope,
make a comeback again during the semester, this is also a good time to
do spend one lecture on it.
Accordingly, the readings for tomorrow's class are:
Sungwook Yoon's paper on learning to improve relaxed plan heuristics:
http://www.public.asu.edu/~syoon10/icaps06.pdf
You can also look at the tutorial on Learning Techniques for Planning:
http://rakaposhi.eas.asu.edu/learn-plan.html
(there is even a video of an earlier version of this tutorial at
http://videolectures.net/mlss06au_kambhampati_ltp/ )
Rao
We continue discussion of HTN planning (chap 11) and then do control
rules--especially
temporal control rules (chap 10)
rao
On 2/11/08, Aishwarya Sivaraman <asivaram@asu.edu> wrote:
> Dr. Rao,
>
> I was just wondering if you have put up the readings for tomorrow's class. I
> could not see any updates on the blog.
>
> Best Regards,
> Aishwarya Sivaraman
>
It covers everything done until last Tuesday
It will be due back in 2 weeks.
Please take homeworks seriously; there is a possibility that we will decide
not to have separate exams.
Also, please note that you are required to do the
work yourself and not use any old solutions that you may have been
able to download
from the web. It is okay to discuss with other students, instructor
etc, but the final work must
be done by you.
Rao
http://rakaposhi.eas.asu.edu/AMali99.pdf]
The slides for the talk accompanying the paper are at
http://rakaposhi.eas.asu.edu/mali-aaai99/index.htm
Rao
On Feb 7, 2008 6:17 AM, Tuan A. Nguyen <Tuan.Anh.Nguyen@asu.edu> wrote:
> Dear Rao,
> Your paper "On the utility of plan-space (causal) encoding" can not be
> downloaded from your homepage.
> Best regards,
> A.Tuan
>
There will be a slight change of plan for next class. Instead of
doing partial satisfaction planning, we will do
"knowledge-based" planning--with some emphasis on hierarchical task
network planning. The readings for the
class are Chapter 11 and then Chapter 10 (in that order) from Ghallab/Nau book.
Rao
Here is the working URL
http://citeseer.ist.psu.edu/64263.html
(the reason the oldone is broken is that citeseer at nec is no longer working)
rao
http://rakaposhi.eas.asu.edu/cse574/readings.html#encodings
The second and third papers--CSP encodings and SAT encodings--are most
relevant for next class.
The first one explains the details of how to use EBL in graphplan
(that I described at a highlevel yesterday).
rao
http://raos-ruminations.blogspot.com/2006/07/on-suboptimality-of-optimal-planning.html
for a recent (as in 1.5 years back controversy) about optimality track
of the IPC
Rao
wcushing@asu.edu
j.benton@asu.edu
sungwook.yoon@asu.edu
menkes@asu.edu
rao@asu.edu
rao
As I said, we will discuss any outstanding questions on heuristics
(including probably a quick discussion on use of reachability
heuristics for
Plan-space planning), and then discuss bounded-length planning as a
combinatorial problem. What we did with Graphplan today will become
a simple subcase--with other cases being pushing planning as SAT/CSP/IP etc.
The additional reading for tomorrow's class is:
Chapters 6 and 7 in Nau et. al.'s text book
or if you want a shorter version, Chapter 11 (sections 4 and 5) in
Russell and Norvig (if you don't have the text, here is that chapter
from AIMA site:
http://aima.cs.berkeley.edu/newchap11.pdf )
-----------------
Also, if any of you felt a little lost today and wanted a more
structured lecture on the heuristics, check out
http://rakaposhi.eas.asu.edu/ICAPS06-tutorial/ICAPS06-1.mp3
which is the first part of the tutorial based closely on the paper you
read and slides you saw.
Rao
Thanks for writing down your initial ideas for class projects. We
(myself and the Teaching czars) went through them and
put in our bids.
Please look at https://wiki.asu.edu/planning/index.php/Project_ideas
and see who your primary advisor bid is from.
You will have to contact the advisor and make an appointment (about an
hour or less in most cases) so you can talk to them
and flesh out what is potentially feasible.
You will be required to maintain a wiki page with the summary of the meeting.
Rao
http://rakaposhi.eas.asu.edu/pgSurvey.pdf
(definitely upto page 61,
and preferably upto page 66. Before you die of shock,
note that the paper starts at page 47 ;-)
No summaries are needed--but you should read the paper before coming.
Since part of this material has been covered once in
471 (if you took it from me), I intend to go at a brisk pace (brisk
pace for rao is not unlike realtime for redwoods).
Rao
The term white-knight comes from the "White knights in shining armor
rescuing damsels in distress".. The white-knight step intervenes
between the threatening step and the step needing the condition and
re-establishes the condition.
*NOTE* that while white-knight clause is needed to prove correctness
of arbitrary partial order plans, it is *NOT* needed to ensure
completeness of plan-space planning (which, like progression and
regression, only needs to generate all action sequences that can solve
the problem). While the plan-space planner I discussed in the class
will never be able to terminate with the white-knight example plan
(because there is no way to come up with an un-threatened causal link
for that plan), it is able to terminate with two different
specializations (refinements) of that plan--such that every action
sequence that is a top-sort of the white-knight plan is also a
top-sort of one of these refinements.
*IF* for some strange reason, you do want your plan-space (partial
order) planner to terminate with the white-knight plan, then you need
to generalize the semantics of causal links such that you have three
ways of resolving a threat. Specifically, if a causal link s1--p--s2
is threatened by a step s3 (which has effect ~p), then, we have three
resolution possibilities:
1. Promotion: put s3<s1
2. Demotion: put s2<s3
3. White-knight "de-clobbering"
Find a step s4 (either existing in the plan, or from the library)
such that s4 has effect p.
Introduce ordering s3<s4<s2
(so s4 comes between s3 and s2)
As mentioned above, the white-knight declobbering is unnecessary for
completeness in the space of action sequences.
It is also worth noting that when you allow white-knight declobbering,
you are essentially changing the semantics of the causal-link
constraint. Without declobbering, we in essence say that once
established, the causal link condition *CANNOT BE VIOLATED EVEN
TEMPORARILY*. Allowing declobbering means you can allow it to be
temporarily violated.
[Maintenance goals] Sometimes, plan-space planners use causal links to
model *Maintenance* goals. (The normal goals we saw until now are
goals of attainment--i.e., the conditions must be true at the end.
Sometimes, you also have goals of maintenance--all through the
execution of the plan, a certain condition must hold true. For
example, if you want to walk through a dark basement garage to the
fuse box to reset the fuse, then you need light to help you navigate
the garage. This light must be on pretty much all through the
execution of the plan. This kind of semantics will not be respected by
white-knight declobbering.
[White-knight and Chapmans Modal Truth Criterion;] White-knights
entered planning vocabulary through an (unfortunately) influential
paper from 1987 by David Chapman.
I stored a copy at http://rakaposhi.eas.asu.edu/cse574/notes/chapman-mtc.pdf
for your edification
This paper was unfortunate in that it (1) confused plan-space planning
with the need for being complete in the space of partial plans (which
as I pointed out is *NOT needed*) and (2) consequently assumed that
per-node complexity in plan-space planning will be influenced by the
cost of checking correctness of a partially ordered plan. Another
wrong assumption that Chapman made was that plan-space planners *MUST*
deal with partially instantiated partial order plans (this is the
*lifted planning* I started talking about).
Since a PO Plan can have an exponential topoligical sorts, and many
more ground topological sorts, it is not surprising that checking the
correctness of all those topological sorts can potentially be
exponential.
Chapman's +ve result was that *if* actions have no conditional
effects, and all variables are considered to have *infinite* domains
(which is a bogus assumption), then correctness can be checked in poly
time (note that this requires the white-knight clause). His negative
result was that once you have conditional effects, the correctness
check will be NP-complete. He somehow made this sound as if it was
the end of the world (and the planning community, for its part, went
along with it). On scholar.google.com, you see that Chapman's paper
has had 845 citations--most before 1991..
http://scholar.google.com/scholar?q=chapman+modal+truth+criterion&hl=en&lr=
The third paper in that page--written some 7 years after Chapman's
paper--brings up some of these critiques.
Rao
ps: What is wierd about Chapman's paper is that Chapman should have
known better. 10 years before his work, the planner "Nonlin" was
developed by Austin Tate and it saw farther (the plan-space planner we
discussed today is closer to 1977 paper by Tate than anything in
Chapman's 1987 paper). Apparently Chapman did talk to Tate but somehow
Tate's work is not cited prominently. Checkout
http://rakaposhi.eas.asu.edu/nonlin-review.txt
for an opinionated review of this whole episode..
http://rakaposhi.eas.asu.edu/kambhampati.pdf
Rao
Tuesday class: Ideally by Thursday---but definitely by Saturday
Thursday class: Ideally by Saturday--but definitely by Sunday
This will give us czars a chance to make changes to the notes if needed.
If you see yourself falling behind this schedule, it is your
responsibility to let us know.
regards
rao
An interesting issue about planner completeness is whether or not the
planner is capable of finding *every plan* for the problem.
Let us get some terminology straight.
A sequence of actions a1...an is considered a solution to a problem
[I,G] if we can
execute the sequence starting in state I and G will hold in the final
state after executing an.
A solution plan P: a1...an is said to be minimal if it is not
possible to remove any subset of actions from P
without violating its solution-ness.
An example of a non-minimal plan for achieving On(A,B) when A, B and C
are all on table is:
put A on C, take A of C, put A on B.
Cleraly, we can cut the first two actions out and it will still be a
solution (so it is a non-minimal solution).
Consider the following questions:
0. Given a planning problem that is solvable, how many solutions
(minimal as well as non-minimal) are there?
1. Is progression planner complete for all plans (including minimal
and non-minimal plans)?
2. Is regression planner complete for all plans (including non-minimal)?
3. Will regression planner ever find any non-minimal plans?
4. Will progression planner ever find any non-minimal plans?
5. If a planner is capable of producing non-minimal plans, then will
sussman anomaly be non-serializable for such a planner?
6. Given a possibly non-minimal plan, how costly would it be to
"minimize" it (i.e., remove redundant actions from it)?
(Complexity-wise... is it polynomial?
exponential? etc)
If you are receiving this mail, you are still registered for CSE574
(i.e., forgot to drop the class before add/drop end ;-).
I created a mailing list as well as a blog for the class. Most of you
know the drill:
The intended usage of the mailing list is for me to send you
announcements as well as things to think about.
All mails sent to the mailing list are sent to your email address,
posted to the class mail archive as well as to the class blog.
All of you have posting privileges to the class blog. You can post to
the blog as well as write comments on other posts.
You can keep track of changes to the blog through RSS feed etc.
cheers
Rao