首页 > 编程语言 >pptx:Python的PPT操作库

pptx:Python的PPT操作库

时间:2023-09-28 09:04:37浏览次数:64  
标签:pptx Python text slide prs PPT import Presentation

1. 安装模块

Windows用户打开命令行输入:pip install python-pptx

Mac用户打开终端/Terminal输入:pip3 install python-pptx

使用windows系统,如果出现无法安装情况,可以在cmd模式下输入网址选择国内清华镜像。

pip install -i https://pypi.tuna.tsinghua.edu.cn/simple python-pptx

导入模块:import pptx

2. 读取PPT文档内容

先了解下PPT基本结构在python分别是什么含义:

Slide:幻灯片,就是演示文稿中每一页的页面。

Shape:方框,在每页幻灯片内插入的方框,可以是形状,也可以是文本框。

Run:文字块,一般为较少字符。

Paragraph:段落,通常有序号ㆍ、1.等。

2.1 幻灯片 slide

1)获取slide

.slides-> 得到一个列表,包含了每个slide

from pptx import Presentation

prs = Presentation('示例文件.pptx')
for slide in prs.slides:
    print(slide)

输出结果:

<pptx.slide.Slide object at 0x0000000003737318>
<pptx.slide.Slide object at 0x0000000003737228>
<pptx.slide.Slide object at 0x0000000003737818>
<pptx.slide.Slide object at 0x0000000003737408>
<pptx.slide.Slide object at 0x00000000037377C8>
<pptx.slide.Slide object at 0x00000000037376D8>
<pptx.slide.Slide object at 0x0000000003737F98>
<pptx.slide.Slide object at 0x00000000037372C8>
<pptx.slide.Slide object at 0x00000000037373B8>

2.2 形状 shape

1) 获取形状 shape

from pptx import Presentation

prs = Presentation('示例文件.pptx')
for slide in prs.slides:
    for shape in slide.shapes:
        print(shape)

输出结果:

<pptx.shapes.autoshape.Shape object at 0x000000000379C390>
<pptx.shapes.picture.Picture object at 0x000000000379C4E0>
<pptx.shapes.picture.Picture object at 0x000000000379C0F0>
<pptx.shapes.placeholder.SlidePlaceholder object at 0x000000000379C080>
<pptx.shapes.placeholder.SlidePlaceholder object at 0x000000000379C400>
<pptx.shapes.placeholder.SlidePlaceholder object at 0x000000000379C390>
<pptx.shapes.graphfrm.GraphicFrame object at 0x000000000379C080>
<pptx.shapes.placeholder.SlidePlaceholder object at 0x000000000379C400>

2) 输出shape中的文字

shape.has_text_frame->是否有文字
shape.text_frame->获取文字框

# 获取所有Slide中的内容
from pptx import Presentation

prs = Presentation('示例文件.pptx')
for slide in prs.slides:
    for shape in slide.shapes:
        if shape.has_text_frame:
            text_frame = shape.text_frame
            print(text_frame.text)
# 获取某一页Slide中的内容
from pptx import Presentation

prs = Presentation('示例文件.pptx')
for i, slide in enumerate(prs.slides):
    if i == 3:
        for shape in slide.shapes:
            if shape.has_text_frame:
                text_frame = shape.text_frame
                print(text_frame.text)

2.3 段落 paragraph

1)输出shape中的某个paragraph

从shape中找paragraphs-> 获取shpae中的段落
for paragragh in text_frame.paragraphs:
print(paragragh.text)

from pptx import Presentation

prs = Presentation('示例文件.pptx')
for slide in prs.slides:
    for shape in slide.shapes:
        if shape.has_text_frame:
            text_frame = shape.text_frame
            for paragragh in text_frame.paragraphs:
                print(paragragh.text)

"""
注意:
该方法同样也直接获取Shpae中的文字内容;
但是这个更灵活,先获取每个Shape,然后在获取每个Shape中的paragraph;
我们可以针对paragraph,写一个判断条件,只获取第几个paragraph;
""

3. 向PPT文档写入内容

幻灯片母版及占位符

Slides_layouts:版式,一个幻灯片母版由多个版式组成,索引从0开始。

Placeholder:占位符:存在PPT母版里面的幻灯片的某一部件:Placeholder

3.1 添加slide和内容

1)选择PPT模板

a、使用ppt自带的模板

prs= Presentation()
prs.slide_layouts[index]

ppt自带了常用的1-48种模板通过index选择对应的模板

b、使用自定义ppt模板

prs= Presentation('template.pptx')

2)确认占位符id

prs.slides_layouts[0] # 获取第一套母版的第一个版式

from pptx import Presentation

prs = Presentation("示例文件2.pptx")
slide = prs.slides.add_slide(prs.slide_layouts[0])  # 用第一个母版生成一页ppt
for shape in slide.placeholders:         # 获取这一页所有的占位符
    phf = shape.placeholder_format
    print(f'{phf.idx}--{shape.name}--{phf.type}')  # id号--占位符形状名称-占位符的类型

输出结果:

0–Title 1–TITLE (1)
14–Text Placeholder 2–BODY (2)
15–Text Placeholder 3–BODY (2)
16–Text Placeholder 4–BODY (2)
17–Text Placeholder 5–BODY (2)
18–Text Placeholder 6–BODY (2)

title 标题类型的占位符
subtitle 副标题类型的占位符
body 正文类型的占位符

3)向占位符内填写内容

shape.text = 字符串
prs.save(文件路径.文件名) #同文件夹内可以省略文件路径

绝对文件路径如:C:/Users/Administrator/Desktop/test.pptx

from pptx import Presentation
from datetime import datetime

prs = Presentation("示例文件2.pptx")
slide = prs.slides.add_slide(prs.slide_layouts[0])  # 用第一个母版生成一页ppt
for shape in slide.placeholders:         # 获取这一页所有的占位符
    phf = shape.placeholder_format
    print(f'{phf.idx}--{shape.name}--{phf.type}')
    shape.text = f'{phf.idx}--{phf.type}'  #在占位符中填写“占位符id号--占位符类型”
prs.save('向占位符内填写内容.pptx')

写入前:

写入后:

4)修改占位符的内容

slide.placeholders[占位符ID]

[变量名].text = 字符串

from pptx import Presentation
from datetime import datetime
prs = Presentation("示例文件2.pptx")
slide = prs.slides.add_slide(prs.slide_layouts[0])  # 用第一个母版生成一页ppt
# 确定填写位置
winner_name = slide.placeholders[18]
certificate_type =slide.placeholders[0]
this_is_to_certify_that = slide.placeholders[14]
winning_reason =slide.placeholders[15]
award_presenter = slide.placeholders[16]
award_date = slide.placeholders[17]
# 修改占位符里的填写内容
winner_name.text = "韩梅梅"
certificate_type.text = "三好学生奖状"
this_is_to_certify_that.text = "兹证明"
winning_reason.text = "德学体美劳全面发展"
award_presenter.text = "xx校长"
award_date.text = str(datetime.now().date())

prs.save('修改占位符的内容.pptx')

3.2 添加段落

from pptx import Presentation

prs = Presentation()
bullet_slide_layout = prs.slide_layouts[1] # 使用第2个版式新建幻灯片
slide = prs.slides.add_slide(bullet_slide_layout)
shapes = slide.shapes
title_shape = shapes.title
body_shape =shapes.placeholders[1]

title_shape.text = "添加项目符号列表页"  # 给title占位符添加内容

tf = body_shape.text_frame
tf.text = "带圆点的项目符号行1"   # 给body占位符添加内容

p = tf.add_paragraph()
p.text = "带圆点的项目符号行2"   # 在原来的基础上,添加第一个段落

p = tf.add_paragraph()
p.text = "带圆点的项目符号行3"  # 在原来的基础上,添加第个段落
prs.save('添加段落paragraph.pptx')

给段落设定层级关系

paragraph.level = 层级数【0是最顶层】

p = tf.add_paragraph()
p.text = "带圆点的项目符号行2"
p.level = 1

p = tf.add_paragraph()
p.text = "带圆点的项目符号行3"
p.level = 2

3.3 添加文本框

slide.shapes.add_textbox(left,top.width,height)

  • 厘米写法
from pptx import Presentation
from pptx.util import Cm,Pt

prs = Presentation()
blank_slide_layout = prs.slide_layouts[6]
slide = prs.slides.add_slide(blank_slide_layout)

left = top = width = height =Cm(3)  # # left,top为相对位置,width,height为文本框大小。满足条件顺序是左>上>右>下
text_box = slide.shapes.add_textbox(left,top,width,height)
tf = text_box.text_frame
tf.text = "这是一段文本框里的文字"

p = tf.add_paragraph()
p.text = "这是第二段文字,加粗,字号40"
p.font.bold = True
p.font.size = Pt(40)

prs.save('添加文本框.pptx')
  • 英尺写法
from pptx import Presentation
from pptx.util import Inches
left= top= width= height= Inches(5)

textbox= slide.shapes.add_textbox(left, top, width, height)
textbox.text= '这是一段文本框里的文字'

new_para= textbox.text_frame.add_paragraph()
new_para.text= '这是第二段文字'

prs.save('添加文本框.pptx')

3.4 添加形状

slide_layout = prs.slide_layouts[SLD_LAYOUT_TITLE_AND_CONTENT]

from pptx import Presentation
from pptx.enum.shapes import MSO_SHAPE
from pptx.util import Inches

SLD_LAYOUT_TITLE_AND_CONTENT = 6
prs = Presentation()
slide_layout = prs.slide_layouts[SLD_LAYOUT_TITLE_AND_CONTENT]
slide = prs.slides.add_slide(slide_layout)

# 支持的形状可以看:
# https://docs.microsoft.com/zh-cn/office/vba/api/Office.MsoAutoShapeType
shapes = slide.shapes
left, top, width, height = Inches(1), Inches(3), Inches(1.8), Inches(1)
shape = shapes.add_shape(MSO_SHAPE.ROUNDED_RECTANGLE, left, top, width, height)
shape.text ='第一步'

for n in range(2,6):
    left=left+width-Inches(-0.8)
    shape = slide.shapes.add_shape(MSO_SHAPE.CHEVRON,left,top,width,height)
    shape.text = f'第{n}步'
prs.save('添加形状.pptx')

3.5 添加图片

slide.shapes.add_picture(图片路径,距离左边,距离顶端,宽度,高度)

from pptx import Presentation
from pptx.util import Cm

prs = Presentation()
blank_slide_layout = prs.slide_layouts[6]
slide = prs.slides.add_slide(blank_slide_layout)

left = top = Cm(3)
pic = slide.shapes.add_picture('butterfly.png',left,top)

left = height = Cm(5)
pic = slide.shapes.add_picture('butterfly.png',left,top, height=height)

prs.save('添加图片.pptx')

3.6 添加表格

shapes.add_table(rows,cols,left,top,width,height).table

from pptx import Presentation
from pptx.util import Cm

prs = Presentation()
blank_slide_layout = prs.slide_layouts[6] #使用第7个空白版式创建新的幻灯片
slide = prs.slides.add_slide(blank_slide_layout)

rows,cols = 4,2
left = top =Cm(5)
width = Cm(18)
height = Cm(3)

table = slide.shapes.add_table(rows,cols,left,top,width,height).table
# 可以修改列宽、行高
table.columns[0].width = Cm(6)
table.columns[1].width = Cm(4)
table.rows[0].height =Cm(2)

data = [
    ['姓名','成绩'],
    ['李雷',99],
    ['韩梅梅', 92],
    ['马东梅', 92],
]
for row in range(rows):
    for col in range(cols):
        table.cell(row,col).text =str(data[row][col])

prs.save('添加表格.pptx')

3.7 PPT文档内容样式批量调整

1)文本框位置调整

对齐文本方式:

顶端对齐 .TOP

底端对齐 .BOTTOM

中间对齐 .MIDDLE

from pptx import Presentation
from pptx.util import Cm
from pptx.enum.text import MSO_ANCHOR, MSO_AUTO_SIZE

prs = Presentation()
blank_slide_layout = prs.slide_layouts[6]
slide = prs.slides.add_slide(blank_slide_layout)

left = top = width = height =Cm(3)
text_box = slide.shapes.add_textbox(left,top,width,height)
tf = text_box.text_frame
tf.text = "这是一段文本框里的文字"

tf.margin_bottom = Cm(0.1) #下边距
tf.margin_left = 0 #左边距
tf.vertical_anchor = MSO_ANCHOR.BOTTOM # 对齐文本方式:底端对齐
tf.word_wrap = True # 框中的文字自动换行
prs.save('文本框样式的调整.pptx')

2)文本框背景颜色调整

from pptx.dml.color import RGBColor

fill = text_box.fill
text_box = slide.shapes.add_textbox(left, top, width, height)
fill.solid()  # 纯色填充
fill.fore_color.rgb = RGBColor(247, 150, 70)

from pptx import Presentation
from pptx.util import Cm, Pt
from pptx.enum.text import MSO_ANCHOR, MSO_AUTO_SIZE
from pptx.dml.color import RGBColor

prs = Presentation()
# 使用第七个版式
black_slide_layout = prs.slide_layouts[6]
slide = prs.slides.add_slide(black_slide_layout)

left = top = width = height = Cm(3)
text_box = slide.shapes.add_textbox(left, top, width, height)
tf = text_box.text_frame
tf.text = "这是一段文本框里面的文字"
# -------------------------------------- #
tf.margin_bottom = Cm(0.1) # 下边距
tf.margin_left = 0 # 下边距
tf.vertical_anchor = MSO_ANCHOR.BOTTOM
tf.word_wrap = True # 框中的文字自动换行
# -------------------------------------- #
fill = text_box.fill
fill.solid()
# 使用之前一定要导入RGBColor这个库
fill.fore_color.rgb = RGBColor(100, 149, 237)

prs.save("文本框背景色的调整.pptx")

3)文本框边框样式调整

line = text_box.line

from pptx import Presentation
from pptx.util import Cm, Pt
from pptx.enum.text import MSO_ANCHOR, MSO_AUTO_SIZE
from pptx.dml.color import RGBColor

prs = Presentation()
# 使用第七个版式
black_slide_layout = prs.slide_layouts[6]
slide = prs.slides.add_slide(black_slide_layout)

left = top = width = height = Cm(3)
text_box = slide.shapes.add_textbox(left, top, width, height)
tf = text_box.text_frame
tf.text = "这是一段文本框里面的文字"
# -------------------------------------- #
tf.margin_bottom = Cm(0.1) # 下边距
tf.margin_left = 0 # 下边距
tf.vertical_anchor = MSO_ANCHOR.BOTTOM
tf.word_wrap = True # 框中的文字自动换行
# -------------------------------------- #
fill = text_box.fill
fill.solid()
# 使用之前一定要导入RGBColor这个库
fill.fore_color.rgb = RGBColor(247, 150, 70)
# -------------------------------------- #
line = text_box.line
line.color.rgb = RGBColor(255, 0, 0)
line.width = Cm(0.3)

prs.save("文本框边框样式调整.pptx")

4)段落及字体样式调整

段落对齐调整

from pptx.enum.text import PP_ALIGN

p.alignment = 对齐方式

PP_ALIGN.LEFTPP_ALIGN.CENTERPP_ALIGN.RIGHT

from pptx import Presentation
from pptx.enum.text import PP_ALIGN
from pptx.util import Cm

prs = Presentation()
# 使用第七个版式
black_slide_layout = prs.slide_layouts[6]
slide = prs.slides.add_slide(black_slide_layout)

left = top = width = height = Cm(3)
text_box = slide.shapes.add_textbox(left, top, width, height)
tf = text_box.text_frame
# ---------------------------- #
p = tf.add_paragraph()
p.text = "这是第二段文字"
p.alignment = PP_ALIGN.LEFT  # 左对齐

prs.save("段落对其调整.pptx")
段落其他样式调整

与word中的使用非常相似:

.add_run() 添加新的文字块
.level 段落缩进层级
.line_spacing 段落行间距
.runs 段落内的文字块
.space_after 段后距
.space_before 段前距

字体样式调整

.font.name 字体名称(可以直接设定为中文字体)

.font.bold 是否加粗

.font.italic 是否斜体

.font.color 字体颜色

.font.size 字体大小

from pptx import Presentation
from pptx.util import Cm, Pt
from pptx.enum.text import MSO_ANCHOR, MSO_AUTO_SIZE
from pptx.dml.color import RGBColor
from pptx.enum.text import PP_ALIGN

prs = Presentation()
# 使用第七个版式
black_slide_layout = prs.slide_layouts[6]
slide = prs.slides.add_slide(black_slide_layout)

left = top = width = height = Cm(3)
text_box = slide.shapes.add_textbox(left, top, width, height)
tf = text_box.text_frame
# ---------------------------- #
p = tf.add_paragraph()
p.text = "这是第二段文字"
p.alignment = PP_ALIGN.LEFT
# ------------------------------------- #
p.font.bold = True
p.font.name = "宋体"
p.font.color.rgb = RGBColor(247, 150, 70)
p.font.size = Pt(30)

prs.save("字体样式调整.pptx")

标签:pptx,Python,text,slide,prs,PPT,import,Presentation
From: https://www.cnblogs.com/arena/p/17734804.html

相关文章

  • python解决ModuleNotFoundError No module named 'HTMLTestRunner'问题修改
    1、报错截图2、解决方法:(1)py2:从http://tungwaiyip.info/software/HTMLTestRunner.html下载HTMLTestRunner.py并将文件放到python2安装目录的Lib下,然后再次运行文件,可成功运行! (2) py3:a: 从http://tungwaiyip.info/software/HTMLTestRunner.html下载HTMLTestRunner.py......
  • python 安装 cv2 - 已解决
    <!--more-->python安装cv2-已解决建议收藏提供多种操作,一定有一种适合你的情况就这么个情况,就是装不上。本文没一句废话,仔细阅读。1.传统方式1(执行一遍,一条条试)可尝试python3-mpipinstall--upgradepippipinstallcv2pip3installopencv-python国内......
  • Python:operator模块
    methodcaller:调用参数指定的方法,实例方法和类方法都可以;04.Python冷知识:你可能不知道的三个操作符-itemgetter,attrgetter,methodcaller_哔哩哔哩_bilibilifromoperatorimportitemgetter,attrgetter,methodcallerprint("下面演示itemgetter用法:")get_index_zero=......
  • 【Python进阶-PyQt5】00搭建PyQt5环境
    1.创建独立开发虚拟环境1.1虚拟环境简介我们编写的程序,有时用到的Python库是不一样的,比如说开发桌面应用程序我们主要用到PyQt5相关的Python库、开发Web应用程序我们主要用到Django相关的Python库等等。假设我们在开发桌面应用程序的时候除了PyQt5相关的Python库外,还附加了其他......
  • Python实现自动生成四则运算题目和答案检测
    这个作业属于哪个课程软件工程这个作业要求在哪里结对项目这个作业的目标实现自动生成四则运算题目功能,以及给定题目和答案、判断答案对错的功能本项目上传至个人GitHub:yulinnn/PythonProject_FourOperations项目需求题目:实现一个自动生成小学四则运算题目的......
  • mojo编程语言:编译后的mojo二进制执行文件调用python库报错——设置MOJO_PYTHON_LIBRAR
    代码:frompythonimportPythonfnf()raises:#ThisisequivalenttoPython's`importnumpyasnp`letnp=Python.import_module("numpy")leta=np.array([1,2,3])print(a)fnmain()raises:f() mojo编译后执行,报错:Moj......
  • python智能切分视频画面
    #pipinstallscenedetectopencv-python-ihttps://pypi.tuna.tsinghua.edu.cn/simplefromscenedetect.video_managerimportVideoManagerfromscenedetect.scene_managerimportSceneManagerfromscenedetect.stats_managerimportStatsManagerfromscenedetect.d......
  • 水果识别系统Python+TensorFlow+卷积神经网络算法【图像识别】
    引言随着科技的发展,我们生活中的各种便利工具日益增加。例如,你有没有想过,当你在超市里看到一个陌生的水果,却不知道它是什么名字时,有一个工具可以帮你识别出来?今天,我要为大家介绍一种基于Python的水果识别系统。这个系统不仅识别准确,还具有友好的用户界面。下面,让我们一起探索这个......
  • python+unittest跳过测试和预期失败
    在运行测试时,有时需要直接跳过某些测试用例,或者当测试用例符合某个条件时跳过测试,又或者直接将测试用例设置为失败。、importunittestfromleap_yearimportLeapYearclassTestLeapYear(unittest.TestCase):@unittest.skip("直接跳过测试")deftest_2000(self......
  • Python中super()的运作机制
    Python中super()的运作机制:super()的定义:defsuper(cls,instance):mro=inst.__class__.mro()returnmro[mro.index(cls)+1]mro(Methodresolutionorder)是python在多继承类中查找目标函数的策略,其采用的是广度优先算法,可通过类的mro方法查看。如下方代码:cla......