描述Transformer network : mapping(Instructions→action tuples;UI obj→obj descriptions;action tuples↔descriptions A pair of UIs as input to capture the semantics info Status日期