博客
关于我
强烈建议你试试无所不能的chatGPT,快点击我
deeplearning.ai - 机器学习策略 (1)
阅读量:4090 次
发布时间:2019-05-25

本文共 1961 字,大约阅读时间需要 6 分钟。

吴恩达 Andrew Ng

机器学习策略 (1)

Orthogonalization 正交化

  • 各个参数的功能尽量独立,就像正交,相互垂直
  • Orthogonalization or orthogonality is a system design property that assures that modifying an instruction or a component of an algorithm will not create or propagate side effects to other components of the

    system.

    1. Fit training set well in cost function

      If it doesn’t fit well, the use of a bigger neural network or switching to a better optimization
      algorithm might help.

    2. Fit development set well on cost function

      If it doesn’t fit well, regularization or using bigger training set might help.

    3. Fit test set well on cost function

      If it doesn’t fit well, the use of a bigger development set might help

    4. Performs well in real world

      If it doesn’t perform well, the development test set is not set correctly or the cost function is not evaluating the right thing.

Single number evaluation metric 单一数字评估指标

  • precision 查准率,预测为真的例子中多少是实际为真的
  • recall 查全率,实际为真的例子中多少预测为真
  • 例子
    这里写图片描述
    Recall = True Positive / (True Positive + False Negative)
  • there’s often a tradeoff between precision and recall
  • F1 Score: harmonic mean(调和平均) of precision P and recall R
  • 判断哪一个模型更优

Satisficing and optimizing metrics

  • optimizing metrics, as accurately as possible 尽可能好
  • satisficing metrics, meet expectation set 满足一定指标即可

Some advices

  • choose the development and test sets from the same distribution and it must be taken randomly from all the data
  • optimize the evaluation metric
  • Bayes optimal error 理论上的最优误差
  • Machine learning progresses slowly when it surpasses human-level performance
  • Human-level error as a proxy for Bayes error
  • If the difference between human-level error and the training error is bigger than the difference between the training error and the development error. The focus should be on bias reduction technique
  • If the difference between training error and the development error is bigger than the difference between the human-level error and the training error. The focus should be on variance reduction technique

Summary

这里写图片描述

你可能感兴趣的文章
利用runtime给类别添加属性
查看>>
本地推送
查看>>
UIImage存为本地文件与UIImage转换为NSData
查看>>
[转]打印质数的各种算法
查看>>
[转]javascript with延伸的作用域是只读的吗?
查看>>
php的autoload与global
查看>>
IE不支持option的display:none属性
查看>>
[分享]mysql内置用于字符串型ip地址和整数型ip地址转换函数
查看>>
Https加密及攻防
查看>>
Java生成随机不重复推广码邀请码
查看>>
【JAVA数据结构】双向链表
查看>>
【JAVA数据结构】先进先出队列
查看>>
String类的intern方法随笔
查看>>
【泛型】一个简易的对象间转换的工具类(DO转VO)
查看>>
1.随机函数,计算机运行的基石
查看>>
MouseEvent的e.stageX是Number型,可见as3作者的考虑
查看>>
移植Vim配色方案到Eclipse
查看>>
谈谈加密和混淆吧[转]
查看>>
TCP的几个状态对于我们分析所起的作用SYN, FIN, ACK, PSH,
查看>>
网络游戏客户端的日志输出
查看>>