编程 Python

python开启摄像头以及深度学习实现目标检测方法

Posted in Python onAugust 03, 2018

最近想做实时目标检测，需要用到python开启摄像头，我手上只有两个uvc免驱的摄像头，性能一般。利用python开启摄像头费了一番功夫，主要原因是我的摄像头都不能用cv2的VideCapture打开，这让我联想到原来opencv也打不开Android手机上的摄像头（后来采用QML的Camera模块实现的）。看来opencv对于摄像头的兼容性仍然不是很完善。

我尝了几种办法：v4l2，v4l2_capture以及simpleCV，都打不开。最后采用pygame实现了摄像头的采集功能，这里直接给大家分享具体实现代码（python3.6，cv2，opencv3.3，ubuntu16.04）。中间注释的部分是我上述方法打开摄像头的尝试，说不定有适合自己的。

import pygame.camera
import time
import pygame
import cv2
import numpy as np
 
def surface_to_string(surface):
 """convert pygame surface into string"""
 return pygame.image.tostring(surface, 'RGB')
 
def pygame_to_cvimage(surface):
 """conver pygame surface into cvimage"""
 
 #cv_image = np.zeros(surface.get_size, np.uint8, 3)
 image_string = surface_to_string(surface)
 image_np = np.fromstring(image_string, np.uint8).reshape(480, 640, 3)
 frame = cv2.cvtColor(image_np, cv2.COLOR_BGR2RGB)
 return image_np, frame
 
 
pygame.camera.init()
pygame.camera.list_cameras()
cam = pygame.camera.Camera("/dev/video0", [640, 480])
 
cam.start()
time.sleep(0.1)
screen = pygame.display.set_mode([640, 480])
 
while True:
 image = cam.get_image()
 
 cv_image, frame = pygame_to_cvimage(image)
 
 screen.fill([0, 0, 0])
 screen.blit(image, (0, 0))
 pygame.display.update()
 cv2.imshow('frame', frame)
 key = cv2.waitKey(1)
 if key & 0xFF == ord('q'):
  break
 
 
 #pygame.image.save(image, "pygame1.jpg")
 
cam.stop()

上述代码需要注意一个地方，就是pygame图片和opencv图片的转化（pygame_to_cvimage）有些地方采用cv.CreateImageHeader和SetData来实现，注意这两个函数在opencv3+后就消失了。因此采用numpy进行实现。

至于目标检测，由于现在网上有很多实现的方法，MobileNet等等。这里我不讲解具体原理，因为我的研究方向不是这个，这里直接把代码贴出来，亲测成功了。

from imutils.video import FPS
import argparse
import imutils
 
 
import v4l2
import fcntl
 
import v4l2capture
import select
import image
 
import pygame.camera
import pygame
import cv2
import numpy as np
import time
 
def surface_to_string(surface):
 """convert pygame surface into string"""
 return pygame.image.tostring(surface, 'RGB')
 
def pygame_to_cvimage(surface):
 """conver pygame surface into cvimage"""
 
 #cv_image = np.zeros(surface.get_size, np.uint8, 3)
 image_string = surface_to_string(surface)
 image_np = np.fromstring(image_string, np.uint8).reshape(480, 640, 3)
 frame = cv2.cvtColor(image_np, cv2.COLOR_BGR2RGB)
 return frame
 
 
ap = argparse.ArgumentParser()
ap.add_argument("-p", "--prototxt", required=True, help="path to caffe deploy prototxt file")
ap.add_argument("-m", "--model", required=True, help="path to caffe pretrained model")
ap.add_argument("-c", "--confidence", type=float, default=0.2, help="minimum probability to filter weak detection")
args = vars(ap.parse_args())
 
CLASSES = ["background", "aeroplane", "bicycle", "bird", "boat", "bottle", "bus", "car", "cat", "chair", "cow",
   "diningtable", "dog", "horse", "motorbike", "person", "pottedplant", "sheep", "sofa", "train", "tvmonitor"]
COLORS = np.random.uniform(0, 255, size=(len(CLASSES), 3))
 
print("[INFO] loading model...")
net = cv2.dnn.readNetFromCaffe(args["prototxt"], args["model"])
 
 
print("[INFO] starting video stream ...")
 
###### opencv ########
#vs = VideoStream(src=1).start()
#
#camera = cv2.VideoCapture(0)
#if not camera.isOpened():
# print("camera is not open")
#time.sleep(2.0)
 
 
###### v4l2 ########
 
#vd = open('/dev/video0', 'r')
#cp = v4l2.v4l2_capability()
#fcntl.ioctl(vd, v4l2.VIDIOC_QUERYCAP, cp)
 
#cp.driver
 
 
##### v4l2_capture
#video = v4l2capture.Video_device("/dev/video0")
#size_x, size_y = video.set_format(640, 480, fourcc= 'MJPEG')
#video.create_buffers(30)
 
#video.queue_all_buffers()
 
#video.start()
 
##### pygame ####
pygame.camera.init()
pygame.camera.list_cameras()
cam = pygame.camera.Camera("/dev/video0", [640, 480])
 
cam.start()
time.sleep(1)
 
fps = FPS().start()
 
 
while True:
 #try:
 # frame = vs.read()
 #except:
 # print("camera is not opened")
 
 #frame = imutils.resize(frame, width=400)
 #(h, w) = frame.shape[:2]
 
 
 #grabbed, frame = camera.read()
 #if not grabbed:
 # break
 #select.select((video,), (), ())
 #frame = video.read_and_queue()
 
 #npfs = np.frombuffer(frame, dtype=np.uint8)
 #print(len(npfs))
 #frame = cv2.imdecode(npfs, cv2.IMREAD_COLOR)
 
 image = cam.get_image()
 frame = pygame_to_cvimage(image)
 
 frame = imutils.resize(frame, width=640)
 blob = cv2.dnn.blobFromImage(frame, 0.00783, (640, 480), 127.5)
 
 net.setInput(blob)
 detections = net.forward()
 
 for i in np.arange(0, detections.shape[2]):
 
  confidence = detections[0, 0, i, 2]
 
  if confidence > args["confidence"]:
 
   idx = int(detections[0, 0, i, 1])
   box = detections[0, 0, i, 3:7]*np.array([640, 480, 640, 480])
   (startX, startY, endX, endY) = box.astype("int")
 
   label = "{}:{:.2f}%".format(CLASSES[idx], confidence*100)
   cv2.rectangle(frame, (startX, startY), (endX, endY), COLORS[idx], 2)
   y = startY - 15 if startY - 15 > 15 else startY + 15
 
   cv2.putText(frame, label, (startX, y), cv2.FONT_HERSHEY_SIMPLEX, 0.5, COLORS[idx], 2)
 
 cv2.imshow("Frame", frame)
 key = cv2.waitKey(1)& 0xFF
 
 if key ==ord("q"):
  break
 
 
fps.stop()
print("[INFO] elapsed time :{:.2f}".format(fps.elapsed()))
print("[INFO] approx. FPS :{:.2f}".format(fps.fps()))
 
 
 
cv2.destroyAllWindows()
 
#vs.stop()

上面的实现需要用到两个文件，是caffe实现好的模型，我直接上传（文件名为MobileNetSSD_deploy.caffemodel和MobileNetSSD_deploy.prototxt，上google能够下载到）。

以上这篇python开启摄像头以及深度学习实现目标检测方法就是小编分享给大家的全部内容了，希望能给大家一个参考，也希望大家多多支持三水点靠木。

python开启摄像头以及深度学习实现目标检测方法

- Author -

红色未来

声明：登载此文出于传递更多信息之目的，并不意味着赞同其观点或证实其描述。

Python 相关文章推荐

python threading模块操作多线程介绍

Apr 08 Python

Python中用字符串调用函数或方法示例代码

Aug 04 Python

Tensorflow实现AlexNet卷积神经网络及运算时间评测

May 24 Python

python3.6使用pickle序列化class的方法

Oct 22 Python

为什么Python中没有"a++"这种写法

Nov 27 Python

简单介绍python封装的基本知识

Aug 10 Python

Python模块_PyLibTiff读取tif文件的实例

Jan 13 Python

python3 xpath和requests应用详解

Mar 06 Python

Pymysql实现往表中插入数据过程解析

Jun 02 Python

使用K.function()调试keras操作

Jun 17 Python

Python字符串格式化常用手段及注意事项

Jun 17 Python

深度学习tensorflow基础mnist

Apr 14 Python

Python函数参数操作详解

Aug 03 #Python

利用python打开摄像头及颜色检测方法

Aug 03 #Python

numpy添加新的维度:newaxis的方法

Aug 02 #Python

numpy.ndarray 交换多维数组(矩阵)的行/列方法

Aug 02 #Python

对numpy中的transpose和swapaxes函数详解

Aug 02 #Python

Numpy 改变数组维度的几种方法小结

Aug 02 #Python

python 字典中取值的两种方法小结

Aug 02 #Python

You might like

PHP动态编译出现Cannot find autoconf的解决方法

2014/11/05 PHP

javascript的数据类型、字面量、变量介绍

2012/05/23 Javascript

为EasyUI的Tab标签添加右键菜单的方法

2012/07/14 Javascript

js导航菜单(自写)简单大方

2013/03/28 Javascript

javascript抖动元素的小例子

2013/10/28 Javascript

js实现数字每三位加逗号的方法

2015/02/05 Javascript

jQuery 1.9.1源码分析系列（十五）动画处理之缓动动画核心Tween

2015/12/03 Javascript

AngularJS模块详解及示例代码

2016/08/17 Javascript

vue引入jq插件的实例讲解

2017/09/12 Javascript

Angular实现的简单查询天气预报功能示例

2017/12/27 Javascript

vue 配置多页面应用的示例代码

2018/10/22 Javascript

Vue中UI组件库之Vuex与虚拟服务器初识

2019/05/07 Javascript

taro开发微信小程序的实践

2019/05/21 Javascript

vue 获取url参数、get参数返回数组的操作

2020/11/12 Javascript

浅谈Vue开发人员的7个最好的VSCode扩展

2021/01/20 Vue.js

[08:44]和酒神一起战斗 DOTA2教你做大人

2014/03/27 DOTA

Python 列表(List)操作方法详解

2014/03/11 Python

Python3基础之函数用法

2014/08/13 Python

python使用PyGame模块播放声音的方法

2015/05/20 Python

Python工程师面试题与Python基础语法相关

2016/01/14 Python

python使用turtle库与random库绘制雪花

2018/06/22 Python

Python读取txt内容写入xls格式excel中的方法

2018/10/11 Python

在PyCharm下打包*.py程序成.exe的方法

2018/11/29 Python

Python3.5 Pandas模块之Series用法实例分析

2019/04/23 Python

python实现在线翻译功能

2020/03/03 Python

HTML+CSS3模拟心的跳动实例代码

2017/09/05 HTML / CSS

WWE美国职业摔角官方商店：WWE Shop

2018/11/15 全球购物

牦牛毛户外探险服装：Kora

2019/02/08 全球购物

会计实习生工作总结的自我评价

2013/10/07 职场文书

主治医师岗位职责

2013/12/10 职场文书

出纳工作岗位责任制

2014/02/02 职场文书

物流毕业生个人的自我评价

2014/02/13 职场文书

公司人事专员岗位职责

2014/08/11 职场文书

团代会闭幕词

2015/01/28 职场文书

幼儿园园长工作总结2015

2015/05/25 职场文书

Python访问Redis的详细操作

2021/06/26 Python