2017.12.27 edit lesson 5

yijun_latitude · yijun_latitude · commit 921ccc039694 · 2016-12-27T15:23:56.000+08:00
diff --git a/python_basic/python_basic_lesson_05.ipynb b/python_basic/python_basic_lesson_05.ipynb
@@ -696,11 +696,233 @@
     "\n",
     "print(matches)"
    ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "---\n",
+    "\n",
+    "## 文件和目录操作之二\n",
+    "\n",
+    "读写文件是最常见的IO操作。Python内置了读写文件的函数，用法和C是兼容的。\n",
+    "\n",
+    "读写文件前，我们先必须了解一下，在磁盘上读写文件的功能都是由操作系统提供的，现代操作系统不允许普通的程序直接操作磁盘，所以，读写文件就是请求操作系统打开一个文件对象，然后，通过操作系统提供的接口从这个文件对象中读取数据，或者把数据写入这个文件对象。\n",
+    "\n",
+    "##### 读文件\n",
+    "\n",
+    "函数 `open()` 返回 文件对象，通常的用法需要两个参数：`open(filename, mode)`。分别是文件名和打开模式\n",
+    "\n",
+    "在做下面的例子前，我们要创建一个 `test.txt` 文件，并且保证其中的内容是如下样式，包含三行内容：\n",
+    "\n",
+    "> hello\n",
+    "\n",
+    "> hi\n",
+    "\n",
+    "> byebye\n",
+    "\n",
+    "文件保存在可以访问的目录，我这里就保存在和 notebook 同样的目录\n",
+    "\n",
+    "> 使用 jupyter 可以直接新建 Text File，来完成建立和编辑文本文件"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {
+    "collapsed": true
+   },
+   "outputs": [],
+   "source": [
+    "import os\n",
+    "\n",
+    "# 获得当前路径\n",
+    "cd = os.getcwd()\n",
+    "\n",
+    "print(cd)\n",
+    "\n",
+    "# 拼接完整文件名\n",
+    "filename = os.path.join('/Users/Feng', 'test.txt')\n",
+    "\n",
+    "print(filename)\n",
+    "\n",
+    "try:\n",
+    "    # 打开文件\n",
+    "    f = open(filename, 'r')\n",
+    "    print(f.read())\n",
+    "finally:\n",
+    "    if f:\n",
+    "        f.close()"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {
+    "collapsed": true
+   },
+   "outputs": [],
+   "source": [
+    "# 简化调用方式\n",
+    "# 省却了 try...finally，会有 with 来自动控制\n",
+    "\n",
+    "with open(filename, 'r') as f:\n",
+    "    print(f.read())"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {
+    "collapsed": true
+   },
+   "outputs": [],
+   "source": [
+    "with open(filename, 'r') as f:\n",
+    "    lines = f.readlines()\n",
+    "\n",
+    "print(type(lines))\n",
+    "print(lines)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {
+    "collapsed": true
+   },
+   "outputs": [],
+   "source": [
+    "for i in lines:\n",
+    "    print(i)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {
+    "collapsed": true
+   },
+   "outputs": [],
+   "source": [
+    "# 更简单的按行读取文件内容方法\n",
+    "with open(filename, 'r') as f:\n",
+    "    for eachline in f:\n",
+    "        print(eachline)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "---\n",
+    "\n",
+    "##### 写文件\n",
+    "\n",
+    "写文件和读文件是一样的，唯一区别是调用 `open()` 函数时，传入标识符 `'w'` 或者 `'wb'` 表示写文本文件或写二进制文件。\n",
+    "\n",
+    "r 以读方式打开\n",
+    "w 以写方式打开\n",
+    "a 以追加模式打开（必要时候创建新文件）"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {
+    "collapsed": true
+   },
+   "outputs": [],
+   "source": [
+    "# 写文件\n",
+    "import os\n",
+    "\n",
+    "# 获得当前路径\n",
+    "cd = os.getcwd()\n",
+    "\n",
+    "# 拼接完整文件名\n",
+    "filename= os.path.join(cd, 'test2.txt')\n",
+    "\n",
+    "# 换行符\n",
+    "br = os.linesep\n",
+    "\n",
+    "# 写文件\n",
+    "with open(filename, 'w') as f:\n",
+    "    f.write('Hello, World!' + br)\n",
+    "    f.write('Hello, Shanghai!' + br)\n",
+    "    f.write('Hello, CHINA!' + br)\n",
+    "    \n",
+    "with open(filename, 'r') as f:\n",
+    "    print(f.read())"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {
+    "collapsed": true
+   },
+   "outputs": [],
+   "source": [
+    "##### 操作系统和文件系统差异处理\n",
+    "\n",
+    "`linesep` 文件中分隔行的字符串\n",
+    "`path.sep` 分割文件路径名的字符串\n",
+    "`curdir` 当前工作目录的字符串\n",
+    "`pardir` 当前工作目录的父目录字符串"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "---\n",
+    "\n",
+    "##### 使用 glob 包查找文件\n",
+    "\n",
+    "glob 是 python 自己带的一个文件操作相关模块，很简洁，用它可以查找符合自己目的的文件，就类似于Windows下的文件搜索，而且也支持通配符: `*,?,[]` 这三个通配符，\\* 代表0个或多个字符，? 代表一个字符，[] 匹配指定范围内的字符，如[0-9]匹配数字。\n",
+    "\n",
+    "glob 的主要方法也叫 glob，该方法返回所有匹配的文件路径列表，该方法需要一个参数用来指定匹配的路径字符串"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {
+    "collapsed": true
+   },
+   "outputs": [],
+   "source": [
+    "# 使用 glob 来遍历指定路径下的指定类型文件\n",
+    "import glob\n",
+    "\n",
+    "# notebook 写法\n",
+    "glob.glob('/Users/yijun/dev_python/*/*.py')\n",
+    "\n",
+    "# IDLE 写法\n",
+    "l = glob.glob('/Users/yijun/dev_python/*/*.py')\n",
+    "for i in l:\n",
+    "    print(i)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {
+    "collapsed": true
+   },
+   "outputs": [],
+   "source": [
+    "l = glob.glob('/Users/yijun/dev_python/*/e2*.py')\n",
+    "for i in l:\n",
+    "    print(i)"
+   ]
   }
  ],
  "metadata": {
+  "anaconda-cloud": {},
   "kernelspec": {
-   "display_name": "Python 3",
+   "display_name": "Python [default]",
    "language": "python",
    "name": "python3"
   },
@@ -714,7 +936,7 @@
    "name": "python",
    "nbconvert_exporter": "python",
    "pygments_lexer": "ipython3",
-   "version": "3.4.5"
+   "version": "3.5.2"
   }
  },
  "nbformat": 4,