通过提前考虑风险和成本来增强学习的AI工具

Safety Gym was announced by OpenAI, a non-profit organization that studies artificial intelligence. Existing reinforcement learning pointed out that AI has the potential to cause unexpected errors due to dangerous actions, and was introduced as a tool that can perform agent reinforcement learning while respecting safety restrictions.

Safety Gym is a reinforcement learning agent or a module for AI that maintains motivation toward a goal by reward and punishment. Open AI introduced constrained reinforcement learning in Safety Gym where AI automatically thinks about cost and conducts simulation.

Constrained reinforcement learning agents set cost targets at the beginning of learning and perform learning using rewards and punishments. In other words, AI through constrained reinforcement learning is required to predict risks in advance.

Safety Gym uses three agents: Point, Car, and Doggo to explore the congested environment and reach the goal. In addition, three tasks are set: a goal to a designated area, a button that continuously passes through a checkpoint, and a push to push an object to a designated position. In addition, there are two levels of difficulty, and a warning light flashes around the agent whenever the agent performs an unsafe task.

The point is that a robot with a rotating actuator and an actuator for forward and backward movement runs on a 2D plane. The car is driven by a robot with two independently driven front wheels and one rotating rear wheel. In order for the car robot to turn or move, it has to operate two front wheels at the same time. Dogo is a simulation of a symmetrical robot with 4 legs. The leg must be manipulated so that the azimuth and relief angle can be manipulated on the fuselage, and the robot does not fall even if the angle adjustment joint is lowered.

Open AI says that since safety gym is still a developing country, a lot of work is still needed to combine safety technology in addition to other problems. Three tasks are realization of constrained reinforcement learning along with performance improvement, safe transfer learning, distribution change problem investigation, and human taste. Explain that there is. Through a system like Safety Gym, AI developers are expected to work on a shared system to facilitate collaboration on the safety of the entire AI field. Related information can be found here .

lswcap

通过每月的AHC PC和HowPC杂志时代，他在网络IT媒体上观看了“技术时代”，如ZDNet，电子报互联网经理，Consumer Journal Ivers的编辑，TechHolic出版商和Venture Square的编辑。我很好奇这个仍然充满活力的市场。

View all posts

通过提前考虑风险和成本来增强学习的AI工具

이것이 좋아요:

lswcap

Add comment

Cancel reply

细胞机器人证实具有自我复制能力

就像生活一样，有限制…社交媒体限制为 100 个帖子？

为什么女人比男人活得长

Topics

Recent posts

细胞机器人证实具有自我复制能力

就像生活一样，有限制…社交媒体限制为 100 个帖子？

为什么女人比男人活得长

“土著草药对癌症治疗有效”

印度政府“Starlink，从服务前获得许可证……”

黑色星期五在线销售额首次下降的原因是什么？

丹麦 sensibility 女前轮轮毂摩托

Spotify 取消车窗模式

澳大利亚拟强制披露社交媒体匿名用户信息

Email Newsletter

Techrecipe

Follow us

Most popular

纵向农业和未来粮食

ARM体系结构是如何诞生的

AR办公室“远程协作增强现实……”

“智能药丸”来了

对自动车辆数据所有权的挑战

每小时拆解200个单位…… Apple第二代回收机器人

市中心航运服务的时代

UFS 3.0“智能手机数据传输速度快两倍”

特斯拉谁去了海边？电动船漂浮。

LiDAR和移动新的可能性

Android内部…’车辆共享’定制电动滑板车？

瑞士无人机谷告诉我们

Most discussed

通过提前考虑风险和成本来增强学习的AI工具

이 글 공유하기:

이것이 좋아요:

lswcap

Add comment

You may also like

Topics

Recent posts

Email Newsletter

Techrecipe

Follow us

Most popular

Most discussed