tensorflow c++踩的坑

Posted on 2018-08-05 | Edited on 2018-10-16

问题1：头文件找不到

编译tensorflow源码时添加环境变量
几种方式：

打开 /etc/profile 文件，在末尾添加 export PATH=(自己的路径)$PATH
把编译好的文件复制到 /usr/local/install/ 中

编译时tensorflow的一些头文件找不到
看错误提示，到相应的文件夹下看看有没有相应的文件

存在，说名环境变量没有设置正确
若不存在，看看是否存在同样文件名的 .proto 文件，使用 protoc 生成 .cpp 和 .h 文件
若也不存在 .proto文件，使用 find |grep 文件名，找到相应文件，可以复制到错误提示的文件夹中，或者把当前路径添加到环境变量中(原文件中使用的名字需要作相应的改动改动)

自己写的文件中方法找不到

ctrl + 点击，跳转到源文件中查看源码，若却是不存在，使用 grep 在源文件中搜索定义的地方，然后在自己的代码中替换,并作相应修改

xzp@xzp-HP-Pavilion-Notebook:/usr/local/include/tf$ grep ends_with /usr/local/include/tf -r
/usr/local/include/tf/tensorflow/docs_src/tutorials/images/image_recognition.md:  if (tensorflow::StringPiece(file_name).ends_with(".png")) {
/usr/local/include/tf/tensorflow/contrib/makefile/downloads/protobuf/src/google/protobuf/stubs/stringpiece.h:  bool ends_with(StringPiece x) const {
/usr/local/include/tf/tensorflow/contrib/makefile/downloads/protobuf/src/google/protobuf/stubs/stringpiece.cc:  if (ends_with(x)) {
/usr/local/include/tf/tensorflow/contrib/makefile/downloads/protobuf/src/google/protobuf/stubs/stringpiece_unittest.cc:  EXPECT_TRUE(a.ends_with(a));
/usr/local/include/tf/tensorflow/contrib/makefile/downloads/protobuf/src/google/protobuf/stubs/stringpiece_unittest.cc:  EXPECT_TRUE(a.ends_with("bar"));
/usr/local/include/tf/tensorflow/contrib/makefile/downloads/protobuf/src/google/protobuf/stubs/stringpiece_unittest.cc:  EXPECT_TRUE(a.ends_with(e));
/usr/local/include/tf/tensorflow/contrib/makefile/downloads/protobuf/src/google/protobuf/stubs/stringpiece_unittest.cc:  EXPECT_TRUE(b.ends_with(s1));
/usr/local/include/tf/tensorflow/contrib/makefile/downloads/protobuf/src/google/protobuf/stubs/stringpiece_unittest.cc:  EXPECT_TRUE(b.ends_with(b));
/usr/local/include/tf/tensorflow/contrib/makefile/downloads/protobuf/src/google/protobuf/stubs/stringpiece_unittest.cc:  EXPECT_TRUE(b.ends_with(e));
/usr/local/include/tf/tensorflow/contrib/makefile/downloads/protobuf/src/google/protobuf/stubs/stringpiece_unittest.cc:  EXPECT_TRUE(e.ends_with(""));
/usr/local/include/tf/tensorflow/contrib/makefile/downloads/protobuf/src/google/protobuf/stubs/stringpiece_unittest.cc:  EXPECT_TRUE(!a.ends_with(b));
/usr/local/include/tf/tensorflow/contrib/makefile/downloads/protobuf/src/google/protobuf/stubs/stringpiece_unittest.cc:  EXPECT_TRUE(!b.ends_with(a));
/usr/local/include/tf/tensorflow/contrib/makefile/downloads/protobuf/src/google/protobuf/stubs/stringpiece_unittest.cc:  EXPECT_TRUE(!e.ends_with(a));
/usr/local/include/tf/tensorflow/contrib/makefile/downloads/protobuf/src/google/protobuf/stubs/stringpiece_unittest.cc:  // ends_with
/usr/local/include/tf/tensorflow/contrib/makefile/downloads/protobuf/src/google/protobuf/stubs/stringpiece_unittest.cc:  EXPECT_TRUE(abc.ends_with(abc));
/usr/local/include/tf/tensorflow/contrib/makefile/downloads/protobuf/src/google/protobuf/stubs/stringpiece_unittest.cc:  EXPECT_TRUE(!abc.ends_with("abcdefguvwxyz"));
/usr/local/include/tf/tensorflow/contrib/makefile/downloads/protobuf/src/google/protobuf/stubs/stringpiece_unittest.cc:  EXPECT_TRUE(abc.ends_with("nopqrstuvwxyz"));
/usr/local/include/tf/tensorflow/contrib/makefile/downloads/protobuf/src/google/protobuf/util/internal/utility.cc:  return text.ends_with(suffix);
/usr/local/include/tf/tensorflow/contrib/makefile/downloads/absl/absl/strings/match.h:// Returns whether a given std::string `text` ends with `ends_with`, ignoring case
/usr/local/include/tf/tensorflow/contrib/makefile/downloads/googletest/googlemock/test/gmock-matchers_test.cc:  Matcher<const string&> ends_with_ok = EndsWith("ok");
/usr/local/include/tf/tensorflow/contrib/makefile/downloads/googletest/googlemock/test/gmock-matchers_test.cc:  ASSERT_THAT("book", ends_with_ok);
/usr/local/include/tf/tensorflow/contrib/makefile/downloads/googletest/googlemock/test/gmock-matchers_test.cc:  EXPECT_NONFATAL_FAILURE(EXPECT_THAT(bad, ends_with_ok),
/usr/local/include/tf/tensorflow/contrib/makefile/downloads/re2/re2/stringpiece.h:  bool ends_with(const StringPiece& x) const {
xzp@xzp-HP-Pavilion-Notebook:/usr/local/include/tf$

protoc 命令使用踩过的坑

版本不正确
把目前使用的protoc 卸载，然后安装相应的版本，如果使用 apt remove protoc 卸载失败，可以直接在/usr/local/include/ 中把头文件文件夹删了，在 /usr/local/bin/ 中把 protoc 删除
找不到依赖的 .proto 文件，使用 protoc -I=相应路径这种方式(如：sudo protoc -I=/usr/local/include/tf/ -I=. –cpp_out=. cluster.proto )

IIC驱动调试踩坑

Posted on 2018-06-19

最近需要使用一个ad的加速度传感器，虽然传感器提供SPI和IIC两种通信方式，由于传感器与主控芯片连接有一定距离，所以只能使用IIC。但是主控芯片使用的是ST cortex-M4内核的芯片，ST的IIC使用硬件总线的话总是有些问题，所以使用模拟IIC。

1.驱动编写

官方只给了一个ad自家芯片驱动的SPI例程，所以只能自己写驱动。

按照数据手册和时序图，写完驱动。

2.调试驱动

2.1 IIC发送无回复应答

驱动完成以后需要调试，刚开始IIC根本收不到回复，

w_ack = 1
w_ack = 1
w_ack = 1
ui8temp  = 0x 0
w_ack = 1
w_ack = 1
w_ack = 1

又重新看了下时序和代码，发现代码没什么问题，后来重启了一下，可以成功收到回复

w_ack = 0
w_ack = 0
w_ack = 0
ui8temp  = 0x 0
w_ack = 0
w_ack = 0
w_ack = 0

有时候串口也会受到J-link的影响，可能是电路有点问题。

2.2 数据读取不出来

虽然应答可以收到，但还是取不出数据，仔细看了数据手册，发现有一处细节没有注意。

上图红色圈到的地方，在发送完寄存器地址后，收到应答应该再次发送IIC start信号，但是给遗漏了。加上重新发送开始信号后，可以读到数据。

0xed
0xff
0xff
0xff
0xff
0xff

2.3 只能读到一次数据

虽然数据可以读到，但是如上面看到的，只有第一次可以读到，后续还是没有数据。这时我把最后一个ACK改成了1，但是还是依然如此，忽然想起来好像是我将时钟分频改了，又原来的168改成了8所以导致这种结果。

2.4 隔一次读到一次

改后是可以读到数据了，但是隔一次读一个，这又是哪出问题了，示波器看波形。

呃，什么情况，本来空闲状态都应该是高电平的，这么一会高一会低。

有点崩溃了，后来来回找了好久，才发现，上次把最后一个ACK给搞错了。改回低电平，成功读取数据。

3.总结

驱动的编写调试是嵌入式编程的基础，但这也是最复杂最让人厌烦的事情，需要耐心和细心。

韩智鸿老师说波形有些杂波，韩老师建议加104p旁路电容。IIC需要加4k7的电阻。

感谢韩智鸿老师和linuxhan的帮助。

Learning-Compression Algorithms for Neural Net Pruning

Posted on 2018-06-11 | Edited on 2018-12-25

python笔记

Posted on 2018-05-30 | Edited on 2018-10-16

[TOC]

基础知识

运算符

三元运算符

其他语言：
    a = x>y:x?y
python:
    a = x if x>y else y

set()

装饰器和神奇的@

装饰器 Decorator
在程序运行期间动态增加功能的方式称为‘装饰器’。其实就是高阶函数，把原来的函数作为装饰器的参数运行，然后返回一个闭包代替原来函数。装饰器可以在运行函数前进行预处理，如参数类型检查等。

使用装饰器的时候，定义函数/对象方法前使用 @ 。

简单装饰器及装饰器运行机制

>>> def log(func):
...     def wrapper(*args,**kw):
...         print('call %s():' %func.__name__)
...         return func(*args,**kw)
...     return wrapper
...
>>> @log #装饰一下
... def now():
...     print('2018-7-24')
...     return 'done'
...
>>> now() #运行时相当于运行 log(now()) -> wrapper(now())
call now():
2018-7-24
'done'

传参装饰器

>>> def log(text):
...     def decorator(func):
...         def wrapper(*args, **kw):
...             print '%s %s():' % (text, func.__name__)
...             return func(*args, **kw)
...         return wrapper
...     return decorator
... 
>>> @log('execute')
... def now():
...     print '2015-10-26'
...     return "done"
... 
>>> now() #运行时相当于运行 log(now()) -> wrapper(now())
excute now()
2018-7-24
'done'

tips

list to str

''.join(str(x) for x in list)

builtins 内容

'ArithmeticError'
'AssertionError'
'AttributeError'
'BaseException'
'BlockingIOError'
'BrokenPipeError'
'BufferError'
'BytesWarning'
'ChildProcessError'
'ConnectionAbortedError'
'ConnectionError'
'ConnectionRefusedError'
'ConnectionResetError'
'DeprecationWarning'
'EOFError'
'Ellipsis'
'EnvironmentError'
'Exception'
'False'
'FileExistsError'
'FileNotFoundError'
'FloatingPointError'
'FutureWarning'
'GeneratorExit'
'IOError'
'ImportError'
'ImportWarning'
'IndentationError'
'IndexError'
'InterruptedError'
'IsADirectoryError'
'KeyError'
'KeyboardInterrupt'
'LookupError'
'MemoryError'
'NameError'
'None'
'NotADirectoryError'
'NotImplemented'
'NotImplementedError'
'OSError'
'OverflowError'
'PendingDeprecationWarning'
'PermissionError'
'ProcessLookupError'
'RecursionError'
'ReferenceError'
'ResourceWarning'
'RuntimeError'
'RuntimeWarning'
'StopAsyncIteration'
'StopIteration'
'SyntaxError'
'SyntaxWarning'
'SystemError'
'SystemExit'
'TabError'
'TimeoutError'
'True'
'TypeError'
'UnboundLocalError'
'UnicodeDecodeError'
'UnicodeEncodeError'
'UnicodeError'
'UnicodeTranslateError'
'UnicodeWarning'
'UserWarning'
'ValueError'
'Warning'
'WindowsError'
'ZeroDivisionError'
'_'
'__build_class__'
'__debug__'
'__doc__'
'__import__'
'__loader__'
'__name__'
'__package__'
'__spec__'
'abs'
'all'
'any'
'ascii'
'bin'
'bool'
'bytearray'
'bytes'

callable()

    函数用于检查一个对象是否是可调用的。如果返回True，object仍然可能调用失败；但如果返回False，调用对象ojbect绝对不会成功。
    
    对于函数, 方法, lambda 函式, 类, 以及实现了 __call__ 方法的类实例, 它都返回 True。

callable(object)
para:   object -- 对象
返回值:可调用返回 True，否则返回 False。

'chr'
'classmethod'
'compile'
'complex'
'copyright'
'credits'
'delattr'
'dict'
'dir'
'divmod'
'enumerate'

eval()

把输入字符串当作表达式

'exec'
'exit'
'filter'
'float'
'format'
'frozenset'
'getattr'
'globals'
'hasattr'
'hash'
'help'
'hex'
'id'
'input'
'int'

int()

int()用于取整，且只能向下取整，并不能四舍五入
round() 有四舍五入功能
若想实现向上取整的功能，可以使用round(values + 0.5)

>>> int(1.3)
1
>>> int(1.8)
1
>>> round(1.3)
1
>>> round(1.8)
2
>>> round(1.3 + 0.5)
2
>>>

'isinstance'
'issubclass'
'iter'
'len'
'license'
'list'
'locals'
'map'
'max'
'memoryview'
'min'
'next'
'object'
'oct'
'open'
'ord'
'pow'
'print'

property()

将类的方法转为只读属性
重新实现一个属性的setter和getter方法

class Person(object):

    def __init__(self, first_name, last_name):
        self.first_name = first_name
        self.last_name = last_name
 
    @property
    def full_name(self):
        return "%s %s" % (self.first_name, self.last_name)

>>> person = Person("Mike", "Driscoll")
>>> person.full_name
'Mike Driscoll'
>>> person.first_name
'Mike'
>>> person.full_name = "Jackalope"
Traceback (most recent call last):
  File "<string>", line 1, in <fragment>
AttributeError: can't set attribute
>>> person.first_name = "Dan"
>>> person.full_name
'Dan Driscoll'

#原来一个类都是这样
>>> class Fees(object):
...     def __init__(self):
...         self._fee = None
...     def get_fee(self):
...         return self._fee
...     def set_fee(self,value):
...         self._fee=value
...
>>> f = Fees()
>>> f.set_fee(1)
>>> f.get_fee()
1

#用property改一下
>>> class Fees(object):
...     def __init__(self):
...         self._fee = None
...     def get_fee(self):
...         return self._fee
...     def set_fee(self,value):
...         self._fee=value
...     fee = property(get_fee,set_fee)
...
>>> f = Fees()
>>> f.fee = 1
>>> f.fee
1

>>> class Fees(object):
...     def __init__(self):
...         self._fee = None
...     @property
...     def fee(self):
...         return self._fee
...     @fee.setter
...     def fee(self,value):
...         self._fee=value
...
>>> f = Fees()
>>> f.fee = 2
>>> f.fee
2

'quit'
'range'
'repr'
'reversed'

round()

round()四舍五入，参考int()

'set'
'setattr'
'slice'
'sorted'
'staticmethod'
'str'

sum()

>>> a
[1, 2, 3]
>>> sum(a)
6
>>>

'super'
'tuple'
'type'
'vars'
'zip'

To solve this problem, we need to understand "What is the use of median". In statistics, the median is used for dividing a set into two equal length subsets, that one subset is always greater than the other. If we understand the use of median for dividing, we are very close to the answer.

First let's cut A into two parts at a random position i:

      left_A             |        right_A
A[0], A[1], ..., A[i-1]  |  A[i], A[i+1], ..., A[m-1]
Since A has m elements, so there are m+1 kinds of cutting( i = 0 ~ m ). And we know: len(left_A) = i, len(right_A) = m - i . Note: when i = 0 , left_A is empty, and when i = m , right_A is empty.

With the same way, cut B into two parts at a random position j:

      left_B             |        right_B
B[0], B[1], ..., B[j-1]  |  B[j], B[j+1], ..., B[n-1]
Put left_A and left_B into one set, and put right_A and right_B into another set. Let's name them left_part and right_part :

      left_part          |        right_part
A[0], A[1], ..., A[i-1]  |  A[i], A[i+1], ..., A[m-1]
B[0], B[1], ..., B[j-1]  |  B[j], B[j+1], ..., B[n-1]
If we can ensure:

1) len(left_part) == len(right_part)
2) max(left_part) <= min(right_part)
then we divide all elements in {A, B} into two parts with equal length, and one part is always greater than the other. Then median = (max(left_part) + min(right_part))/2.

To ensure these two conditions, we just need to ensure:

(1) i + j == m - i + n - j (or: m - i + n - j + 1)
    if n >= m, we just need to set: i = 0 ~ m, j = (m + n + 1)/2 - i
(2) B[j-1] <= A[i] and A[i-1] <= B[j]
ps.1 For simplicity, I presume A[i-1],B[j-1],A[i],B[j] are always valid even if i=0/i=m/j=0/j=n . I will talk about how to deal with these edge values at last.

ps.2 Why n >= m? Because I have to make sure j is non-nagative since 0 <= i <= m and j = (m + n + 1)/2 - i. If n < m , then j may be nagative, that will lead to wrong result.

So, all we need to do is:

Searching i in [0, m], to find an object `i` that:
    B[j-1] <= A[i] and A[i-1] <= B[j], ( where j = (m + n + 1)/2 - i )
And we can do a binary search following steps described below:

<1> Set imin = 0, imax = m, then start searching in [imin, imax]

<2> Set i = (imin + imax)/2, j = (m + n + 1)/2 - i

<3> Now we have len(left_part)==len(right_part). And there are only 3 situations
     that we may encounter:
    <a> B[j-1] <= A[i] and A[i-1] <= B[j]
        Means we have found the object `i`, so stop searching.
    <b> B[j-1] > A[i]
        Means A[i] is too small. We must `ajust` i to get `B[j-1] <= A[i]`.
        Can we `increase` i?
            Yes. Because when i is increased, j will be decreased.
            So B[j-1] is decreased and A[i] is increased, and `B[j-1] <= A[i]` may
            be satisfied.
        Can we `decrease` i?
            `No!` Because when i is decreased, j will be increased.
            So B[j-1] is increased and A[i] is decreased, and B[j-1] <= A[i] will
            be never satisfied.
        So we must `increase` i. That is, we must ajust the searching range to
        [i+1, imax]. So, set imin = i+1, and goto <2>.
    <c> A[i-1] > B[j]
        Means A[i-1] is too big. And we must `decrease` i to get `A[i-1]<=B[j]`.
        That is, we must ajust the searching range to [imin, i-1].
        So, set imax = i-1, and goto <2>.
When the object i is found, the median is:

max(A[i-1], B[j-1]) (when m + n is odd)
or (max(A[i-1], B[j-1]) + min(A[i], B[j]))/2 (when m + n is even)
Now let's consider the edges values i=0,i=m,j=0,j=n where A[i-1],B[j-1],A[i],B[j] may not exist. Actually this situation is easier than you think.

What we need to do is ensuring that max(left_part) <= min(right_part). So, if i and j are not edges values(means A[i-1],B[j-1],A[i],B[j] all exist), then we must check both B[j-1] <= A[i] and A[i-1] <= B[j]. But if some of A[i-1],B[j-1],A[i],B[j] don't exist, then we don't need to check one(or both) of these two conditions. For example, if i=0, then A[i-1] doesn't exist, then we don't need to check A[i-1] <= B[j]. So, what we need to do is:

Searching i in [0, m], to find an object `i` that:
    (j == 0 or i == m or B[j-1] <= A[i]) and
    (i == 0 or j == n or A[i-1] <= B[j])
    where j = (m + n + 1)/2 - i
And in a searching loop, we will encounter only three situations:

<a> (j == 0 or i == m or B[j-1] <= A[i]) and
    (i == 0 or j = n or A[i-1] <= B[j])
    Means i is perfect, we can stop searching.

<b> j > 0 and i < m and B[j - 1] > A[i]
    Means i is too small, we must increase it.

<c> i > 0 and j < n and A[i - 1] > B[j]
    Means i is too big, we must decrease it.
Thank @Quentin.chen , him pointed out that: i < m ==> j > 0 and i > 0 ==> j < n . Because:

m <= n, i < m ==> j = (m+n+1)/2 - i > (m+n+1)/2 - m >= (2*m+1)/2 - m >= 0    
m <= n, i > 0 ==> j = (m+n+1)/2 - i < (m+n+1)/2 <= (2*n+1)/2 <= n
So in situation <b> and <c>, we don't need to check whether j > 0 and whether j < n.

Below is the accepted code:

 def median(A, B):
    m, n = len(A), len(B)
    if m > n:
        A, B, m, n = B, A, n, m
    if n == 0:
        raise ValueError

    imin, imax, half_len = 0, m, (m + n + 1) / 2
    while imin <= imax:
        i = (imin + imax) / 2
        j = half_len - i
        if i < m and B[j-1] > A[i]:
            # i is too small, must increase it
            imin = i + 1
        elif i > 0 and A[i-1] > B[j]:
            # i is too big, must decrease it
            imax = i - 1
        else:
            # i is perfect

            if i == 0: max_of_left = B[j-1]
            elif j == 0: max_of_left = A[i-1]
            else: max_of_left = max(A[i-1], B[j-1])

            if (m + n) % 2 == 1:
                return max_of_left

            if i == m: min_of_right = B[j]
            elif j == n: min_of_right = A[i]
            else: min_of_right = min(A[i], B[j])

            return (max_of_left + min_of_right) / 2.0

常用数据处理库

numpy

合并

>>> x = np.array([1,2,3])
>>> y = np.array([4,5,6])
>>> np.concatenate([x,y],axis = 0)
array([1, 2, 3, 4, 5, 6])
>>> np.vstack((x,y))
array([[1, 2, 3],
       [4, 5, 6]])
>>> np.hstack((x,y))
array([1, 2, 3, 4, 5, 6])

pandas

关键缩写和包导入
在这个速查手册中，我们使用如下缩写：

df：任意的Pandas DataFrame对象
s：任意的Pandas Series对象
同时我们需要做如下的引入：

import pandas as pd
import numpy as np

导入数据

pd.read_csv(filename)：从CSV文件导入数据
pd.read_table(filename)：从限定分隔符的文本文件导入数据
pd.read_excel(filename)：从Excel文件导入数据
pd.read_sql(query, connection_object)：从SQL表/库导入数据
pd.read_json(json_string)：从JSON格式的字符串导入数据
pd.read_html(url)：解析URL、字符串或者HTML文件，抽取其中的tables表格
pd.read_clipboard()：从你的粘贴板获取内容，并传给read_table()
pd.DataFrame(dict)：从字典对象导入数据，Key是列名，Value是数据

导出数据

df.to_csv(filename)：导出数据到CSV文件
df.to_excel(filename)：导出数据到Excel文件
df.to_sql(table_name, connection_object)：导出数据到SQL表
df.to_json(filename)：以Json格式导出数据到文本文件

创建测试对象

pd.DataFrame(np.random.rand(20,5))：创建20行5列的随机数组成的DataFrame对象
pd.Series(my_list)：从可迭代对象my_list创建一个Series对象
df.index = pd.date_range(‘1900/1/30’, periods=df.shape[0])：增加一个日期索引

查看、检查数据

df.head(n)：查看DataFrame对象的前n行
df.tail(n)：查看DataFrame对象的最后n行
df.shape()：查看行数和列数
df.info()：查看索引、数据类型和内存信息
df.describe()：查看数值型列的汇总统计
s.value_counts(dropna=False)：查看Series对象的唯一值和计数
df.apply(pd.Series.value_counts)：查看DataFrame对象中每一列的唯一值和计数

数据选取

df[col]：根据列名，并以Series的形式返回列
df[[col1, col2]]：以DataFrame形式返回多列
s.iloc[0]：按位置选取数据
s.loc[‘index_one’]：按索引选取数据
df.iloc[0,:]：返回第一行
df.iloc[0,0]：返回第一列的第一个元素

df.sample(n=None, frac=None, replace=False, weights=None, random_state=None, axis=None)

n是要抽取的行数。（例如n=20000时，抽取其中的2W行）

frac是抽取的比列。（有一些时候，我们并对具体抽取的行数不关系，我们想抽取其中的百分比，这个时候就可以选择使用frac，例如frac=0.8，就是抽取其中80%）

replace抽样后的数据是否代替原DataFrame()

weights这个是每个样本的权重，具体可以看官方文档说明。

random_state这个在之前的文章已经介绍过了。

axis是选择抽取数据的行还是列。axis=0的时是抽取行，axis=1时是抽取列（也就是说axis=1时，在列中随机抽取n列，在axis=0时，在行中随机抽取n行）

数据清理

df.columns = [‘a’,’b’,’c’]：重命名列名
pd.isnull()：检查DataFrame对象中的空值，并返回一个Boolean数组
pd.notnull()：检查DataFrame对象中的非空值，并返回一个Boolean数组
df.dropna()：删除所有包含空值的行
df.dropna(axis=1)：删除所有包含空值的列
df.dropna(axis=1,thresh=n)：删除所有小于n个非空值的行
df.fillna(x)：用x替换DataFrame对象中所有的空值
s.astype(float)：将Series中的数据类型更改为float类型
s.replace(1,’one’)：用‘one’代替所有等于1的值
s.replace([1,3],[‘one’,’three’])：用’one’代替1，用’three’代替3
df.rename(columns=lambda x: x + 1)：批量更改列名
df.rename(columns={‘old_name’: ‘new_ name’})：选择性更改列名
df.set_index(‘column_one’)：更改索引列
df.rename(index=lambda x: x + 1)：批量重命名索引

df.drop(df,axis=…):删除行或列

>>> df
     A      B         C         D
0  foo    one  0.016336  0.087302
1  bar    one -0.394784 -2.609699
2  foo    two -0.241163  0.429637
3  bar  three -1.450263  1.574793
4  foo    two -0.436486  0.047045
5  bar    two  0.378663 -0.596585
6  foo    one  0.576077  0.036312
7  foo  three -1.507273  0.212231
>>> df.drop('A',axis = 1)
       B         C         D
0    one  0.016336  0.087302
1    one -0.394784 -2.609699
2    two -0.241163  0.429637
3  three -1.450263  1.574793
4    two -0.436486  0.047045
5    two  0.378663 -0.596585
6    one  0.576077  0.036312
7  three -1.507273  0.212231
>>> df.drop(['A','B'],axis = 1)
          C         D
0  0.016336  0.087302
1 -0.394784 -2.609699
2 -0.241163  0.429637
3 -1.450263  1.574793
4 -0.436486  0.047045
5  0.378663 -0.596585
6  0.576077  0.036312
7 -1.507273  0.212231
>>> df.drop([1,2,4],axis = 0)
     A      B         C         D
0  foo    one  0.016336  0.087302
3  bar  three -1.450263  1.574793
5  bar    two  0.378663 -0.596585
6  foo    one  0.576077  0.036312
7  foo  three -1.507273  0.212231

数据处理：Filter、Sort和GroupBy

df[df[col] > 0.5]：选择col列的值大于0.5的行
df.sort_values(col1)：按照列col1排序数据，默认升序排列
df.sort_values(col2, ascending=False)：按照列col1降序排列数据
df.sort_values([col1,col2],ascending=[True,False])：先按列col1升序排列，后按col2降序排列数据
df.groupby(col)：返回一个按列col进行分组的Groupby对象
df.groupby([col1,col2])：返回一个按多列进行分组的Groupby对象
df.groupby(col1)[col2]：返回按列col1进行分组后，列col2的均值
df.pivot_table(index=col1,values=[col2,col3],aggfunc=max)：创建一个按列col1进行分组，并计算col2和col3的最大值的数据透视表
df.groupby(col1).agg(np.mean)：返回按列col1分组的所有列的均值
data.apply(np.mean)：对DataFrame中的每一列应用函数np.mean

data.apply(np.max,axis=1)：对DataFrame中的每一行应用函数np.max

#同一属性先移动到同一行，在做差
df = pd.DataFrame({'A' : ['foo', 'bar', 'foo', 'bar','foo', 'bar', 'foo', 'foo'],
'B' : ['one', 'one', 'two', 'three','two', 'two', 'one', 'three'],
'C' : np.random.randn(8),
'D' : np.random.randn(8)})

>>> df
     A      B         C         D         E
0  foo    one -0.660824  0.758762  0.097938
1  bar    one  1.673826 -0.369888  1.303938
2  foo    two  1.151669  0.750455  1.902124
3  bar  three -0.902216 -0.344720 -1.246936
4  foo    two -0.232781 -1.256137 -1.488918
5  bar    two  0.387244 -0.671663 -0.284419
6  foo    one -1.199822  0.078424 -1.121398
7  foo  three -0.404454  0.271658 -0.132796
>>> df['shift'] = df.groupby('A')['C'].apply(lambda i:i.shift(1))
>>> df
     A      B         C         D         E     shift
0  foo    one -0.660824  0.758762  0.097938       NaN
1  bar    one  1.673826 -0.369888  1.303938       NaN
2  foo    two  1.151669  0.750455  1.902124 -0.660824
3  bar  three -0.902216 -0.344720 -1.246936  1.673826
4  foo    two -0.232781 -1.256137 -1.488918  1.151669
5  bar    two  0.387244 -0.671663 -0.284419 -0.902216
6  foo    one -1.199822  0.078424 -1.121398 -0.232781
7  foo  three -0.404454  0.271658 -0.132796 -1.199822
>>> df['diff'] = df['C'] - df['shift']
>>> df
     A      B         C         D         E     shift      diff
0  foo    one -0.660824  0.758762  0.097938       NaN       NaN
1  bar    one  1.673826 -0.369888  1.303938       NaN       NaN
2  foo    two  1.151669  0.750455  1.902124 -0.660824  1.812493
3  bar  three -0.902216 -0.344720 -1.246936  1.673826 -2.576042
4  foo    two -0.232781 -1.256137 -1.488918  1.151669 -1.384450
5  bar    two  0.387244 -0.671663 -0.284419 -0.902216  1.289460

#求A'A'中同一个元素对应的几个 'C'的 去量纲化

>>> df['shift'] = df.groupby('A')['C'].apply(lambda i:(i-i.min())/i.std())
#错了，对的df['shift'] = df.groupby('A')['C'].apply(lambda i:(i-i.mean())/i.std())
>>> df
     A      B         C         D         E     shift      diff
0  foo    one -0.660824  0.758762  0.097938  0.616460       NaN
1  bar    one  1.673826 -0.369888  1.303938  2.000000       NaN
2  foo    two  1.151669  0.750455  1.902124  2.689432  1.812493
3  bar  three -0.902216 -0.344720 -1.246936  0.000000 -2.576042
4  foo    two -0.232781 -1.256137 -1.488918  1.106018 -1.384450
5  bar    two  0.387244 -0.671663 -0.284419  1.001117  1.289460
6  foo    one -1.199822  0.078424 -1.121398  0.000000 -0.967042
7  foo  three -0.404454  0.271658 -0.132796  0.909673  0.795368

#概率计算
>>> rating_A = f.groupby('A').size().div(len(f))
>>> rating_A
A
bar    0.375
foo    0.625
dtype: float64

#条件概率
>>> f.groupby(['A', 'B']).size().div(len(f)).div(rating_A, axis=0, level='A')
A    B
bar  one      0.333333
     three    0.333333
     two      0.333333
foo  one      0.400000
     three    0.200000
     two      0.400000
dtype: float64

数据合并

df1.append(df2)：将df2中的行添加到df1的尾部
df.concat([df1, df2],axis=1)：将df2中的列添加到df1的尾部
df1.join(df2,on=col1,how=’inner’)：对df1的列和df2的列执行SQL形式的join

数据统计

df.describe()：查看数据值列的汇总统计
df.mean()：返回所有列的均值
df.corr()：返回列与列之间的相关系数
df.count()：返回每一列中的非空值的个数
df.max()：返回每一列的最大值
df.min()：返回每一列的最小值
df.median()：返回每一列的中位数

df.std()：返回每一列的标准差

pd.factorize() #标签映射为数字
>>> df = pd.DataFrame({"id":[1,2,3,4,5,6,3,2], "raw_grade":['a', 'b', 'b','a', 'a','e','c','a']})
>>> x,y = pd.factorize(df.raw_grade)
>>> x
array([0, 1, 1, 0, 0, 2, 3, 0], dtype=int64)
>>> y
Index(['a', 'b', 'e', 'c'], dtype='object')
>>> pd.factorize(df.raw_grade)
(array([0, 1, 1, 0, 0, 2, 3, 0], dtype=int64), Index(['a', 'b', 'e', 'c'], dtype='object'))

多条件筛选

在使用dataframe处理数据的时候碰到了按照条件选取行的问题，单个条件时可以使用：

df[df[‘one’] > 5]

如果多个条件的话需要这么写：

import numpy as np

df[np.logical_and(df[‘one’]> 5,df[‘two’]>5)]

也可以这么写

df[(df[‘one’]> 5) & (df[‘two’]>5)]

数值优化来一波

数值变量

df_int = df_t.select_dtypes(include=['int'])
df_t[df_int.columns] = df_int.apply(pd.to_numeric, downcast='unsigned')

df_float = df_t.select_dtypes(include=['int'])
df_t[df_int.columns] = df_float.apply(pd.to_numeric, downcast='float')

object变量

可以使用 one_hot
可以使用 categoricals
其实都一样啦

#大佬的方法
def one_hot_encoder(df, nan_as_category=True):
    original_columns = list(df.columns)
    categorical_columns = [col for col in df.columns if df[col].dtype == 'object']
    df = pd.get_dummies(df, columns=categorical_columns, dummy_na=nan_as_category)
    new_columns = [c for c in df.columns if c not in original_columns]
    return df, new_columns

python工具类插件

itertools

迭代器的特点是：惰性求值（Lazy evaluation），即只有当迭代至某个值时，它才会被计算，这个特点使得迭代器特别适合于遍历大文件或无限集合等，因为我们不用一次性将它们存储在内存中。

Python 内置的 itertools 模块包含了一系列用来产生不同类型迭代器的函数或类，这些函数的返回都是一个迭代器，我们可以通过 for 循环来遍历取值，也可以使用 next() 来取值。

itertools 模块提供的迭代器函数有以下几种类型：

无限迭代器：生成一个无限序列，比如自然数序列 1, 2, 3, 4, …；
有限迭代器：接收一个或多个序列（sequence）作为参数，进行组合、分组和过滤等；
组合生成器：序列的排列、组合，求序列的笛卡儿积等；

无限迭代器

count(firstval=0, step=1)

创建一个从 firstval (默认值为 0) 开始，以 step (默认值为 1) 为步长的的无限整数迭代器

>>> nums = itertools.count(10, 2)    # 指定开始值和步长
>>> for i in nums:
...     if i > 20:
...         break
...     print i
...
10
12
14
16
18
20

cycle(iterable)

对 iterable 中的元素反复执行循环，返回迭代器

>>> cycle_strings = itertools.cycle('ABC')
>>> i = 1
>>> for string in cycle_strings:
...     if i == 10:
...         break
...     print i, string
...     i += 1
...
1 A
2 B
3 C
4 A
5 B
6 C
7 A
8 B
9 C

repeat(object [,times]

反复生成 object，如果给定 times，则重复次数为 times，否则为无限

>>> for item in itertools.repeat('hello world', 3):
...     print item
...
hello world
hello world
hello world
>>>
>>> for item in itertools.repeat([1, 2, 3, 4], 3):
...     print item
...
[1, 2, 3, 4]
[1, 2, 3, 4]
[1, 2, 3, 4]

有限迭代器

itertools 模块提供了多个函数（类），接收一个或多个迭代对象作为参数，对它们进行组合、分组和过滤等：

chain(iterable1, iterable2, iterable3, …)

chain 接收多个可迭代对象作为参数，将它们『连接』起来，作为一个新的迭代器返回。

>>> for item in chain([1, 2, 3], ['a', 'b', 'c']):
...     print item
...
1
2
3
a
b
c

chain 还有一个常见的用法：

chain.from_iterable(iterable)
接收一个可迭代对象作为参数，返回一个迭代器：

>>> from itertools import chain
>>>
>>> string = chain.from_iterable('ABCD')
>>> string.next()
'A'

compress(data, selectors)

compress 可用于对数据进行筛选，当 selectors 的某个元素为 true 时，则保留 data 对应位置的元素，否则去除：

1 2	list(compress('ABCDEF', [1, 1, 0, 1, 0, 1])) ['A', 'B', 'D', 'F']

dropwhile(predicate, iterable)

其中，predicate 是函数，iterable 是可迭代对象。对于 iterable 中的元素，如果 predicate(item) 为 true，则丢弃该元素，否则返回该项及所有后续项

1 2	list(dropwhile(lambda x: x < 5, [1, 3, 6, 2, 1])) [6, 2, 1]

groupby(iterable[, keyfunc])

其中，iterable 是一个可迭代对象，keyfunc 是分组函数，用于对 iterable 的连续项进行分组，如果不指定，则默认对 iterable 中的连续相同项进行分组，返回一个 (key, sub-iterator) 的迭代器

>>> data = ['a', 'bb', 'ccc', 'dd', 'eee', 'f']
>>> for key,value_iter in groupby(data,len):
...     print(key,':',list(value_iter))
...
1 : ['a']
2 : ['bb']
3 : ['ccc']
2 : ['dd']
3 : ['eee']
1 : ['f']
>>> data = ['a', 'bb', 'cc', 'ddd', 'eee', 'f']
>>> for key,value_iter in groupby(data,len):
...     print(key,':',list(value_iter))
...
1 : ['a']
2 : ['bb', 'cc']
3 : ['ddd', 'eee']
1 : ['f']

ifilter(function or None, sequence)

将 iterable 中 function(item) 为 True 的元素组成一个迭代器返回，如果 function 是 None，则返回 iterable 中所有计算为 True 的项

>>> list(ifilter(lambda x: x < 6, range(10)))
[0, 1, 2, 3, 4, 5]
>>>
>>> list(ifilter(None, [0, 1, 2, 0, 3, 4]))
[1, 2, 3, 4]

ifilterfalse()

和ifilter()类似，看名字也可以看出来了，相反嘛

islice(iterable, [start,] stop [, step])

其中，iterable 是可迭代对象，start 是开始索引，stop 是结束索引，step 是步长，start 和 step 可选

>>> list(islice([10, 6, 2, 8, 1, 3, 9], 5))
[10, 6, 2, 8, 1]
>>>
>>> list(islice(count(), 6))
[0, 1, 2, 3, 4, 5]
>>>
>>> list(islice(count(), 3, 10))
[3, 4, 5, 6, 7, 8, 9]
>>> list(islice(count(), 3, 10 ,2))
[3, 5, 7, 9]

imap()

imap(func, iter1, iter2, iter3, …)
imap 返回一个迭代器，元素为 func(i1, i2, i3, …)，i1，i2 等分别来源于 iter, iter2

>>> imap(str, [1, 2, 3, 4])
<itertools.imap object at 0x10556d050>
>>>
>>> list(imap(str, [1, 2, 3, 4]))
['1', '2', '3', '4']
>>>
>>> list(imap(pow, [2, 3, 10], [4, 2, 3]))
[16, 9, 1000]

starmap()

tee()

tee(iterable [,n])
tee 用于从 iterable 创建 n 个独立的迭代器，以元组的形式返回，n 的默认值是 2

>>> tee('abcd')   # n 默认为 2，创建两个独立的迭代器
(<itertools.tee object at 0x1049957e8>, <itertools.tee object at 0x104995878>)
>>>
>>> iter1, iter2 = tee('abcde')
>>> list(iter1)
['a', 'b', 'c', 'd', 'e']
>>> list(iter2)
['a', 'b', 'c', 'd', 'e']
>>>
>>> tee('abc', 3)  # 创建三个独立的迭代器
(<itertools.tee object at 0x104995998>, <itertools.tee object at 0x1049959e0>, <itertools.tee object at 0x104995a28>)

takewhile()

takewhile(predicate, iterable)
其中，predicate 是函数，iterable 是可迭代对象。对于 iterable 中的元素，如果 predicate(item) 为 true，则保留该元素，只要 predicate(item) 为 false，则立即停止迭代

>>> list(takewhile(lambda x: x < 5, [1, 3, 6, 2, 1]))
[1, 3]
>>> list(takewhile(lambda x: x > 3, [2, 1, 6, 5, 4]))
[]

izip()

izip(iter1, iter2, …, iterN)
如果某个可迭代对象不再生成值，则迭代停止

>>> for item in izip('ABCD', 'xy'):
...     print item
...
('A', 'x')
('B', 'y')
>>> for item in izip([1, 2, 3], ['a', 'b', 'c', 'd', 'e']):
...     print item
...
(1, 'a')
(2, 'b')
(3, 'c')

izip_longest()

izip_longest(iter1, iter2, …, iterN, [fillvalue=None])
如果有指定 fillvalue，则会用其填充缺失的值，否则为 None

>>> for item in izip_longest('ABCD', 'xy'):
...     print item
...
('A', 'x')
('B', 'y')
('C', None)
('D', None)
>>>
>>> for item in izip_longest('ABCD', 'xy', fillvalue='-'):
...     print item
...
('A', 'x')
('B', 'y')
('C', '-')
('D', '-')

组合生成器

itertools 模块还提供了多个组合生成器函数，用于求序列的排列、组合等：

product
permutations
combinations

combinations_with_replacement

product(iter1, iter2, … iterN, [repeat=1])

其中，repeat 是一个关键字参数，用于指定重复生成序列的次数

>>> for item in product('ABCD', 'xy'):
...     print item
...
('A', 'x')
('A', 'y')
('B', 'x')
('B', 'y')
('C', 'x')
('C', 'y')
('D', 'x')
('D', 'y')
>>>
>>> list(product('ab', range(3)))
[('a', 0), ('a', 1), ('a', 2), ('b', 0), ('b', 1), ('b', 2)]
>>>
>>> list(product((0,1), (0,1), (0,1)))
[(0, 0, 0), (0, 0, 1), (0, 1, 0), (0, 1, 1), (1, 0, 0), (1, 0, 1), (1, 1, 0), (1, 1, 1)]
>>>
>>> list(product('ABC', repeat=2))
[('A', 'A'), ('A', 'B'), ('A', 'C'), ('B', 'A'), ('B', 'B'), ('B', 'C'), ('C', 'A'), ('C', 'B'), ('C', 'C')]

permutations(iterable[, r])

其中，r 指定生成排列的元素的长度，如果不指定，则默认为可迭代对象的元素长度

>>> permutations('ABC', 2)
<itertools.permutations object at 0x1074d9c50>
>>>
>>> list(permutations('ABC', 2))
[('A', 'B'), ('A', 'C'), ('B', 'A'), ('B', 'C'), ('C', 'A'), ('C', 'B')]
>>>
>>> list(permutations('ABC'))
[('A', 'B', 'C'), ('A', 'C', 'B'), ('B', 'A', 'C'), ('B', 'C', 'A'), ('C', 'A', 'B'), ('C', 'B', 'A')]

combinations(iterable, r)

其中，r 指定生成组合的元素的长度

1 2	>>> list(combinations('ABC', 2)) [('A', 'B'), ('A', 'C'), ('B', 'C')]

click

click为命令行工具的开发封装了大量的方法，下面简单学习一下。详细使用还要开官方文档。

程序

# hello.py
import click

@click.command()
@click.option('--count', default=1, help='Number of greetings.')
@click.option('--name', prompt='Your name',help='The person to greet.')
def hello(count, name):
    for x in range(count):
        click.echo('Hello %s!' % name)

if __name__ == '__main__':
    hello()

运行

1
2
3

PS G:\test> python .\hello.py
Your name: a
Hello a!

相当于click将原来的函数封装了，可以使用输入参数。

PS G:\test> python .\hello.py --help
Usage: hello.py [OPTIONS]

Options:
  --count INTEGER  Number of greetings.
  --name TEXT      The person to greet.
  --help           Show this message and exit.

click.group()

@click.group()
def cli():
    pass

@click.command()
def initdb():
    click.echo('Initialized the database')

@click.command()
def dropdb():
    click.echo('Dropped the database')

cli.add_command(initdb)
cli.add_command(dropdb)

运行
秒变子命令

PS G:\test> python .\hello.py
Usage: hello.py [OPTIONS] COMMAND [ARGS]...

Options:
  --help  Show this message and exit.

Commands:
  dropdb
  initdb
PS G:\test> python hello.py initdb
Initialized the database

Group.command()
直接关联，如上面程序的升级版就是

import click
@click.group()
def cli():
    pass

@cli.command()
def initdb():
    click.echo('Initialized the database')

@cli.command()
def dropdb():
    click.echo('Dropped the database')

cli()

运行结果

PS G:\test> python .\hello.py
Usage: hello.py [OPTIONS] COMMAND [ARGS]...

Options:
  --help  Show this message and exit.

Commands:
  dropdb
  initdb
PS G:\test> python .\hello.py initdb
Initialized the database

yaml(pickle)

加载yaml文件

>>> import yaml
>>> f = open('test.yml')
>>> f
<_io.TextIOWrapper name='test.yml' mode='r' encoding='UTF-8'>
>>> x = yaml.load(f)
>>> x
{'name': 'Tom Smith', 'age': 37, 'spouse': {'name': 'Jane Smith', 'age': 25}, 'children': [{'name': 'Jimmy Smith', 'age': 15}, {'name1': 'Jenny Smith', 'age1': 12}]}
>>>

name: Tom Smith  
age: 37  
spouse:  
    name: Jane Smith  
    age: 25  
children:  
 - name: Jimmy Smith  
   age: 15  
 - name1: Jenny Smith  
   age1: 12

easydict

可以以属性的方式访问字典的值,和yaml一起使用效果更加，度配置文件

>>> import yaml
>>> from easydict import EasyDict as edict
>>> y = open('test.yml')
>>> x = yaml.load(y)
>>> x
{'name': 'Tom Smith', 'age': 37, 'spouse': {'name': 'Jane Smith', 'age': 25}, 'children': [{'name': 'Jimmy Smith', 'age': 15}, {'name1': 'Jenny Smith', 'age1': 12}]}
>>> d = edict(x)
>>> d
{'name': 'Tom Smith', 'age': 37, 'spouse': {'name': 'Jane Smith', 'age': 25}, 'children': [{'name': 'Jimmy Smith', 'age': 15}, {'name1': 'Jenny Smith', 'age1': 12}]}
>>> d.name
'Tom Smith'
>>> x.name
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
AttributeError: 'dict' object has no attribute 'name'

常用数据库

SQLite

SQLite是一种嵌入式关系型数据库，它的数据库就是一个文件。Python 2.5x以上版本内置了SQLite3，使用时直接import sqlite3即可
教程

操作流程

sqlite3.connect(database,[timeout,…])

打开一个文件与SQLite连接，如果存在打开，不存在，创建。

sqlite3.cursor()

可以创建表

增删改查都可以用 connect.execute()完成…

MySQL

也是关系型数据库，和SQLite差不多，建立连接与光标对象，用execute()执行SQL语句，commit()提交事务，fetchall()获得查询结果

LMDB

属于key-value型(键值对，非关系型)数据库

操作流程

通过env = lmdb.open()打开环境
通过txn = env.begin()建立事务
通过txn.put(key, value)进行插入和修改
通过txn.delete(key)进行删除
通过txn.get(key)进行查询
通过txn.cursor()进行遍历
通过txn.commit()提交更改

LevelDB

同为key-value数据库

操作流程

LevelDB操作时类似与LMDB，使用Put/Get/Delete，但是更加简单（不需要事务txn和commit提交），同时支持范围迭代器RangeIter。

SQLite与MySQL都是关系型数据库，操作时创建连接对象connection与光标对象cursor，通过execute执行SQL语句，commit提交变更，fetch得到查询结果；LMDB与LevelDB都是K-V数据库，操作时建立与数据库的连接，用put/delete改变数据，用get获取数据，区别是LMDB中有事务需要commit，LevelDB不需要。

OBject-Detection

Posted on 2018-05-14 | Edited on 2018-10-16

[TOC]

Object Detection:

graph LR
RCNN --> SPP_NET
SPP_NET-->Fast_RCNN
Fast_RCNN-->Faster_RCNN
Faster_RCNN-->Mask_RCNN

一、R-CNN

graph LR
image-->Region_Proposal
Region_Proposal-->CNN
CNN-->SVM
CNN-->Regression
Regression-->CNN

1.候选框

使用Region Proposal算法选择区域，一般2k个候选区域

2.CNN提取特征

一般使用AlexNet、VGG-16网络作为提取特征的CNN网络。先对网络进行预训练，使用ILVCR 2012作为数据集进行训练，输出为1000维的标签。与训练完后，将最后的分类层删除，4096维特征直接输入SVM进行分类。

3. SVM分类器分类，打分，非极大值抑制

CNN提取的特征输入SVM，分类

4.回归器finetune

CNN提取的特征输入回归器Regression进行对网络FineTune.

缺点：

多个候选区域对应的图像需要预先提取，占用较大的磁盘空间；
针对传统CNN需要固定尺寸的输入图像，crop/warp（归一化）产生物体截断或拉伸，会导致输入CNN的信息丢失；
每一个ProposalRegion都需要进入CNN网络计算，上千个Region存在大量的范围重叠，重复的特征提取带来巨大的计算浪费。

二、SPP NET

Spatial Pyramid Pooling

graph LR
image-->CNN
image-->Region_Proposal
CNN-->Features
Region_Proposal-->Features
Features-->SPP
SPP-->FC
FC-->SVM
FC-->Regression

改进

使用SPP特殊的pooling使得卷积计算大大减少

1.候选框

和RCNN相同

2.对整个图片做SPP_Net卷积

采用空间金字塔池化（SpatialPyramid Pooling ）替换了全连接层之前的最后一个池化层（上图top）.

把整张待检测的图片，输入CNN中，进行一次性特征提取，得到feature maps，然后在feature maps中找到各个候选框的区域，再对各个候选框采用金字塔空间池化，提取出固定长度的特征向量。而R-CNN输入的是每个候选框，然后在进入CNN，因为SPP-Net只需要一次对整张图片进行特征提取，速度会大大提升。

3.SVM

和RCNN相同

4.Regression

和RCNN相同

缺点：

1）和RCNN一样，训练过程仍然是隔离的，提取候选框 | 计算CNN特征| SVM分类 | Bounding Box回归独立训练，大量的中间结果需要转存，无法整体训练参数；

2）SPP-Net在无法同时Tuning在SPP-Layer两边的卷积层和全连接层，很大程度上限制了深度CNN的效果；

3）在整个过程中，Proposal Region仍然很耗时。

三、Fast-RCNN

graph LR
image-->CNN
CNN-->RIO_Pooling
image-->Region_Proposal
RIO_Pooling-->Features
Region_Proposal-->RIO_Pooling
Features-->SoftmaxLoss
SoftmaxLoss-->Loss
Features-->SmoothL1Loss
SmoothL1Loss-->Loss
Loss-->Features

改进

1）借鉴SPP思路，提出简化版的ROI池化层（注意，没用金字塔），同时加入了候选框映射功能，使得网络能够反向传播，解决了SPP的整体网络训练问题；

2）多任务Loss层

A）SoftmaxLoss代替了SVM，证明了softmax比SVM更好的效果
B）SmoothL1Loss取代Bouding box回归。

将分类和边框回归进行合并（又一个开创性的思路），通过多任务Loss层进一步整合深度网络，统一了训练过程，从而提高了算法准确度。

3）全连接层通过SVD加速

4）结合上面的改进，模型训练时可对所有层进行更新，除了速度提升外（训练速度是SPP的3倍，测试速度10倍），得到了更好的检测效果

1.候选框

同上

2.特征提取

整个图片CNN网络提取特征，取出候选框中的特征进行RIO Pooling得到Features

3.LOSS

使用SoftmaxLoss取代了SVM，SmoothL1Loss取代了回归器，最后两个Loss合并。

缺点

虽然已经很厉害了，但是候选框的选取还是慢啊

=========================================

四、Faster-RCNN

graph LR
image-->CNN
CNN-->Features
Features-->RPN
Features-->Feature
RPN-->Feature
Feature-->RIO_Pooling
RIO_Pooling-->SoftmaxLoss
SoftmaxLoss-->Loss
RIO_Pooling-->SmoothL1Loss
SmoothL1Loss-->Loss

改进

这个就厉害了，直接把候选框选取用一个网络实现了，而且是在特征提取过以后，想想就快啊。

Conda使用总结

Posted on 2018-05-10

1 创建新的开发环境：

创建新开发环境的语法如下：

1	conda create --name <envname> python=<version> <optional dependencies>

例如如下命令会在Anaconda安装目录下的envs目录中创建一个名为Python36的新开发环境并且使用的python版本为3.6。

conda create -n Python35 python=3.6

若要在当前目录下创建，则使用

conda create -p <envname> python=<version> <optional dependencies>

2 激活新的开发环境

Linux, OS X:

source activate <envname>

Windows:

activate <envname>

激活了开发环境后可以使用conda或者pip在该开发环境中安装新的包。

3 退出新的开发环境

Linux, OS X:

source deactivate

Windows:

deactivate

4 查看目前拥有的开发环境

conda info --envs

或者

conda env list

5 复制现有开发环境

conda create --name <envname> --clone <existing_env>

6 删除整个开发环境

conda env remove --name <envname> --all

7 导出和导入开发环境

导出：

conda env export > environment.yml

导入：

conda env create -f environment.yml

导出和导入都会自动帮你处理好pip中的依赖。

8 总结

更多说明请查阅官方文档。

python的log使用

Posted on 2018-05-08 | Edited on 2018-05-10

import logging
# create logger
logger_name = os.path.basename(__file__)
logger = logging.getLogger(logger_name)
logger.setLevel(logging.DEBUG)
# create_file
log_path = os.path.basename(__file__)+".log"
fh = logging.FileHandler(log_path)
fh.setLevel(logging.DEBUG)
#打印到屏幕
sh = logging.StreamHandler()
sh.setLevel(logging.DEBUG)

# create formatter
fmt = "[%(asctime)-15s] [%(levelname)s] [%(filename)s - %(lineno)d %(process)d]   %(message)s"
date_fmt = "%a %d %b %Y %H:%M:%S"
formatter = logging.Formatter(fmt, date_fmt)

# add handler and formatter to logger
fh.setFormatter(formatter)
logger.addHandler(fh)
logger.addHandler(sh)

logger.debug(sys.version)
logger.debug(os.getcwd())

PyCharm小技巧

Posted on 2018-05-05 | Edited on 2018-05-10

PyCharm 运行python传入参数

方式1:

启动Temminal，在终端运行

方式2:

在Toolbar找到 run/debug configuretions,选择要运行的脚本，在configuretions的Parameters 输入要传入的参数。

PE8 indentation contains tabs

PE8是一种python编码格式，在PyCharm中会自动检查是否符合这种格式，不符合会有警号。

怎么关闭呢。
File中打开setting,选择Editor,选择Inspactions,找到Python点击,吧PE8 coding style violation的选择取消。

程序运行过程简单记录

Posted on 2018-05-02 | Edited on 2018-05-05

一、从文本生成可执行目标程序

二、执行过程

2.1 读入键盘输入

读取键盘输入放在主存储器,输入回车时，结束命令输入.shell加载可执行hello文件。从硬盘读取到存储器。

2.2 加载程序

iamge

2.3 执行程序

一旦程序加载完成，处理器找到程序中的main 指令，将程序从主存储器复制到寄存器文件,再从寄存器文件复制到屏幕,最终屏幕显示

1	hello world

Taylor公式-多项函数展开

Posted on 2018-04-30

泰勒说：一个函数如果在一点很光滑，也就是说多少阶都可导，那这个一点附近的值就可以用这个点的值和一系列导数值表示。

下面看一下展开式：

一元函数在点xk处的泰勒展开式为：

1	f(x)=f(xk)+(x−xk)f′(xk)+12!(x−xk)2f′′(xk)+On

二元函数在点(xk,yk)处的泰勒展开式为：

1	f(x,y)=f(xk,yk)+(x−xk)f′x(xk,yk)+(y−yk)f′y(xk,yk)+12!(x−xk)2f′′xx(xk,yk)+12!(x−xk)(y−yk)f′′xy(xk,yk)+12!(x−xk)(y−yk)f′′yx(xk,yk)+12!(y−yk)2f′′yy(xk,yk)+On

多元函数(n)在点xk处的泰勒展开式为:

1	f(x1,x2,…,xn)=f(x1k,x2k,…,xnk)+∑i=1n(xi−xik)f′xi(x1k,x2k,…,xnk)+12!∑i,j=1n(xi−xik)(xj−xjk)f′′ij(x1k,x2k,…,xnk)+On

把Taylor展开式写成矩阵的形式：

1	f(x)=f(xk)+[∇f(xk)]T(x−xk)+12![x−xk]TH(xk)[x−xk]+On

其中：