当前位置：首页 > article >正文

ros2-4.2 用python实现人脸识别

article 2025/3/1 2:27:16

需求:创建一个人脸检测服务提供图像，返回人脸数量位置信息
难点分析:
1.人脸怎么识别?face_recognition

2.图片数据和结果怎么传递?服务通信就很合适
3.没有合适的消息接口?自定义一个

4.2.1 自定义服务接口

在chat4/chapt_ws/src下新创建功能包

ros2 pkg create chapt4_interfaces --dependencies rosidl_default_generators sensor_msgs --license Apache-2.0

创建好功能包，在chapt4_interfaces 下创建目录srv,在srv下面创建文件：FaceDetector.srv

内容如下

sensor_msgs/Image image  # 原始图像
---
int16 number      # 人脸个数
float32 use_time  # 识别耗时
int32[] top       # 人脸在图像中位置
int32[] right
int32[] bottom
int32[] left

---分成了两部分，上面表示请求类型是 sensor_msgs/Image的图像。下面是response部分。

然后，再CMakeLists.txt 对该接口文件进行注册，声明为接口文件，添加依赖。代码如下

cmake_minimum_required(VERSION 3.8)
project(chapt4_interfaces)

if(CMAKE_COMPILER_IS_GNUCXX OR CMAKE_CXX_COMPILER_ID MATCHES "Clang")
  add_compile_options(-Wall -Wextra -Wpedantic)
endif()

# find dependencies
find_package(ament_cmake REQUIRED)
find_package(rosidl_default_generators REQUIRED)
find_package(sensor_msgs REQUIRED)

rosidl_generate_interfaces(${PROJECT_NAME}
  "srv/FaceDetector.srv"
  DEPENDENCIES sensor_msgs
)


if(BUILD_TESTING)
  find_package(ament_lint_auto REQUIRED)
  # the following line skips the linter which checks for copyrights
  # comment the line when a copyright and license is added to all source files
  set(ament_cmake_copyright_FOUND TRUE)
  # the following line skips cpplint (only works in a git repo)
  # comment the line when this package is in a git repo and when
  # a copyright and license is added to all source files
  set(ament_cmake_cpplint_FOUND TRUE)
  ament_lint_auto_find_test_dependencies()
endif()

ament_package()

还修改下package.xml ，添加<member_of_group>rosidl_interface_packages</member_of_group>，接下来进行构建。

bohu@bohu-TM1701:~/chapt4/chapt4_ws$ colcon build
Starting >>> chapt4_interfaces
Finished <<< chapt4_interfaces [6.51s]

Summary: 1 package finished [7.56s]
bohu@bohu-TM1701:~/chapt4/chapt4_ws$ source install/setup.bash
bohu@bohu-TM1701:~/chapt4/chapt4_ws$ ros2 interface show chapt4_interfaces/srv/FaceDetector
sensor_msgs/Image image # 原始图像
std_msgs/Header header #
builtin_interfaces/Time stamp
int32 sec
uint32 nanosec
string frame_id
# Header frame_id should be optical frame of camera
# origin of frame should be optical center of cameara
# +x should point to the right in the image
# +y should point down in the image
# +z should point into to plane of the image
# If the frame_id here and the frame_id of the CameraInfo
# message associated with the image conflict
# the behavior is undefined
uint32 height #
uint32 width #
string encoding #
# taken from the list of strings in include/sensor_msgs/image_encodings.hpp
uint8 is_bigendian #
uint32 step #
uint8[] data #
---
int16 number # 人脸个数
float32 use_time # 识别耗时
int32[] top # 人脸在图像中位置
int32[] right
int32[] bottom
int32[] left

4.2.2 人脸检测

pip3 install face_recognition -i https://pypi.tuna.tsinghua.edu.cn/simple

我安装的时候，对于依赖的dlib卡主了好几分钟。

Building wheel for dlib (setup.py) ... 一直在等

关于 face_recognition

github : GitHub - ageitgey/face_recognition: The world's simplest facial recognition api for Python and the command line

可以使用Python和命令行工具提取、识别、操作人脸。应该是1.2.2版本之后不在发新版本维护了。

创建demo_python_service 功能包。

ros2 pkg create demo_python_service --build-type ament_python --dependencies rclpy chapt4_interfaces --license Apache-2.0

添加一张图片，网上找了张我喜欢的大话西游。

在setup.py增加对于图片的复制配置，默认是不会复制到install下面

data_files=[

('share/ament_index/resource_index/packages',

['resource/' + package_name]),

('share/' + package_name, ['package.xml']),

('share/' + package_name+"/resource", ['resource/default.jpg']),

再次构建功能包，就发现图片被复制到chapt4_ws/install/demo_python_service/目录下

/share/demo_python_service/resource 有图片。

编写代码src/demo_python_service/demo_python_service 下创建文件：learn_face_detect.py.代码如下：

import face_recognition
import cv2
from ament_index_python.packages import get_package_share_directory #获取shares目录绝对路径

def main():
    #获取图片绝对路径
    default_image_path = get_package_share_directory('demo_python_service')+'/resource/default.jpg'
    print(f"path:{default_image_path}")
    #cv2加载图片
    image = cv2.imread(default_image_path)
    #检测人脸
    face_recognitions =face_recognition.face_locations(image,number_of_times_to_upsample=1,model='hog')
    #绘制人脸边框
    for top,right,bottom,left in face_recognitions:
        cv2.rectangle(image,(left,top),(right,bottom),(255,0,0),4)
    #显示结果
    cv2.imshow('face detection',image)
    cv2.waitKey(0)

依赖了face_recognition、cv2. 在setup.py对于learn_face_detect节点注册。构建并运行。

bohu@bohu-TM1701:~/chapt4/chapt4_ws$ colcon build
Starting >>> chapt4_interfaces
Finished <<< chapt4_interfaces [0.77s]                     
Starting >>> demo_python_service
Finished <<< demo_python_service [1.37s]          

Summary: 2 packages finished [2.52s]
bohu@bohu-TM1701:~/chapt4/chapt4_ws$ source install/setup.bash 
bohu@bohu-TM1701:~/chapt4/chapt4_ws$ ros2 run demo_python_service learn_face_detect
path:/home/bohu/chapt4/chapt4_ws/install/demo_python_service/share/demo_python_service/resource/default.jpg

效果如下：

4.2.3 人脸检测服务实现

在demo_python_service目录下创建face_detect_node.py 文件，代码如下：

import rclpy
from rclpy.node import Node
from chapt4_interfaces.srv import FaceDetector
import face_recognition
import cv2
from ament_index_python.packages import get_package_share_directory #获取shares目录绝对路径
from cv_bridge import CvBridge #用于图片格式转换
import time

class FaceDetectorionNode(Node):
    def __init__(self):
        super().__init__('face_detection_node')
        self.service_ = self.create_service(FaceDetector,'face_detect',self.detect_face_callback)
        self.bridge = CvBridge();
        self.number_of_times_to_upsample =1;
        self.model = "hog"
        self.default_image_path = get_package_share_directory('demo_python_service')+'/resource/default.jpg'
        self.get_logger().info("人脸检测服务已经启动")

    def detect_face_callback(self,request,response):
        if request.image.data:
            cv_image = self.bridge.imgmsg_to_cv2(request.image)
        else:
            cv_image = cv2.imread(self.default_image_path) #没有图像用默认兜底
            self.get_logger().info(f"传入图像为空，使用默认图像。")  
         #cv_image 已经是一个opencv格式的图像
        start_time = time.time()
        self.get_logger().info(f"加载完成图像，开始识别")  
         #检测人脸
        face_recognitions =face_recognition.face_locations(cv_image,
        number_of_times_to_upsample=self.number_of_times_to_upsample,model=self.model)
        response.use_time = time.time() -start_time
        response.number = len(face_recognitions)
        for top,right,bottom,left in face_recognitions:
            response.top.append(top)
            response.right.append(right)
            response.bottom.append(bottom)
            response.left.append(left)


        return response    #必须返回response

def main(args=None):
    rclpy.init(args=args)
    node = FaceDetectorionNode()
    rclpy.spin(node)
    rclpy.shutdown()

注意的是，使用了CvBridge 进行图片格式（从ros2的图像到opencv格式）的转换。

在代码FaceDetectorionNode 类中继承了Node，并在初始化__init__方法中调用create_service创建了一个服务，三个参数：FaceDetector 是消息接口，第二个face_detect是服务名，第三个detect_face_callback是回调函数。主要逻辑处理在回调函数。response需要返回，其余逻辑跟上一节类似，图像格式转换，进行检测，对response数据赋值。

接着在setup.py对face_detect_node节点进行注册，然后构建和运行。

bohu@bohu-TM1701:~/chapt4/chapt4_ws$ colcon build
Starting >>> chapt4_interfaces
Finished <<< chapt4_interfaces [0.64s]                     
Starting >>> demo_python_service
Finished <<< demo_python_service [1.17s]          

Summary: 2 packages finished [2.16s]
bohu@bohu-TM1701:~/chapt4/chapt4_ws$ ros2 run demo_python_service face_detect_node 
[INFO] [1736320005.105902282] [face_detection_node]: 人脸检测服务已经启动
[INFO] [1736320181.732996097] [face_detection_node]: 传入图像为空，使用默认图像。
[INFO] [1736320181.733454024] [face_detection_node]: 加载完成图像，开始识别

新开终端,查看服务列表并调用

bohu@bohu-TM1701:~/chapt4/chapt4_ws$ ros2 service list -t
/face_detect [chapt4_interfaces/srv/FaceDetector]
/face_detection_node/describe_parameters [rcl_interfaces/srv/DescribeParameters]
/face_detection_node/get_parameter_types [rcl_interfaces/srv/GetParameterTypes]
/face_detection_node/get_parameters [rcl_interfaces/srv/GetParameters]
/face_detection_node/list_parameters [rcl_interfaces/srv/ListParameters]
/face_detection_node/set_parameters [rcl_interfaces/srv/SetParameters]
/face_detection_node/set_parameters_atomically [rcl_interfaces/srv/SetParametersAtomically]
bohu@bohu-TM1701:~/chapt4/chapt4_ws$ source install/setup.bash 
bohu@bohu-TM1701:~/chapt4/chapt4_ws$ ros2 service call /face_detect chapt4_interfaces/srv/FaceDetector
requester: making request: chapt4_interfaces.srv.FaceDetector_Request(image=sensor_msgs.msg.Image(header=std_msgs.msg.Header(stamp=builtin_interfaces.msg.Time(sec=0, nanosec=0), frame_id=''), height=0, width=0, encoding='', is_bigendian=0, step=0, data=[]))

response:
chapt4_interfaces.srv.FaceDetector_Response(number=2, use_time=0.2230970859527588, top=[217, 112], right=[687, 438], bottom=[440, 379], left=[464, 171])

4.2.4 人脸检测客户端实现

再次换一张图片，我这次换了电影海报，人脸多一些。放到resouce下，叫做2.jpg

setup.py也要加一下新图片复制。

接着src/demo_python_service/demo_python_service 下创建文件:face_detect_client_node.py

代码如下：

import rclpy
from rclpy.node import Node
from chapt4_interfaces.srv import FaceDetector
import face_recognition
import cv2
from ament_index_python.packages import get_package_share_directory #获取shares目录绝对路径
from cv_bridge import CvBridge #用于图片格式转换
import time

class FaceDetectClientNode(Node):
    def __init__(self):
        super().__init__('face_detect_client_node')
        self.bridge = CvBridge();
        self.default_image_path = get_package_share_directory('demo_python_service')+'/resource/2.jpg'
        self.get_logger().info("人脸检测客户端已经启动")
        self.client = self.create_client(FaceDetector,'face_detect')
        self.image = cv2.imread(self.default_image_path) 

    def send_request(self):
        #1 判断服务端是否在线
        while self.client.wait_for_service(timeout_sec=1.0) is False:
            self.get_logger().info('等待服务端上线')
        #2构造request
        request =  FaceDetector.Request()
        request.image = self.bridge.cv2_to_imgmsg(self.image)
        #3 发送请求并等待处理完成
        future = self.client.call_async(request) 
        #现在future 并没有包含结果，需要等待服务端处理完成,不能直接sleep
        #rclpy.spin_until_future_complete(self,future)
        def reslut_callback(result_future):
            response =result_future.result() #获取响应
            self.get_logger().info(f'响应检测到{response.number}张脸，耗时{response.use_time}')
            self.show_respnse(response)
        future.add_done_callback(reslut_callback)    

    def show_respnse(self,response):
        for i in range(response.number):
            top = response.top[i]
            right = response.right[i]
            bottom = response.bottom[i]
            left = response.left[i]
            cv2.rectangle(self.image, (left, top),
                        (right, bottom), (255, 0, 0), 2)
            
        cv2.imshow('人脸识别result:',self.image)
        cv2.waitKey(0) #也是阻塞的


def main(args=None):
    rclpy.init(args=args)
    node = FaceDetectClientNode()
    node.send_request()
    rclpy.spin(node)
    rclpy.shutdown()

处理显示图像跟之前差不多，send_request比较巧妙的一点是可以使用异步请求，返回的future没有包含结果，需要等服务端处理完成。这里书上例子采用rclpy.spin_until_future_complete 来等待异步返回结果，视频里小鱼老师用回调函数又演示了一版。

还是要在setup.py注册face_detect_client_node 节点，接着构建。

先启动服务端 ros2 run demo_python_service face_detect_node，再运行客户端

bohu@bohu-TM1701:~/chapt4/chapt4_ws$ colcon build
Starting >>> chapt4_interfaces
Finished <<< chapt4_interfaces [0.71s]
Starting >>> demo_python_service
Finished <<< demo_python_service [1.40s]          

Summary: 2 packages finished [2.47s]
bohu@bohu-TM1701:~/chapt4/chapt4_ws$ ros2 run demo_python_service face_detect_client_node 
[INFO] [1736326208.922806215] [face_detect_client_node]: 人脸检测客户端已经启动
[INFO] [1736326209.369307885] [face_detect_client_node]: 响应检测到8张脸，耗时0.34261298179626465