{ "cells": [ { "cell_type": "markdown", "metadata": { "_cell_guid": "02adde30-2633-41f1-a672-af8ff83a1b02", "_uuid": "22754d0cc0847be93bf947c7998d7fb65a2817d7" }, "source": [ "**대회 목적:**\n", "\n", "경쟁 데이터 세트에는 공개 도메인의 으스스한 작가가 쓴 소설의 텍스트가 포함되어 있습니다.\n", " 1. 에드거 앨런 포(EAP)\n", " 2. HP 러브크래프트(HPL)\n", " 3. 메리 울스턴크래프트 셸리(MWS)\n", " \n", "목표는 테스트 세트에서 문장의 저자를 정확하게 식별하는 것입니다.\n", "\n", "**노트북의 목적:**\n", "\n", "이 노트북에서 무시무시한 작성자를 식별하는 데 도움이 되는 다양한 기능을 만들어 보겠습니다.\n", "\n", "첫 번째 단계로 기능 엔지니어링 부분을 자세히 살펴보기 전에 몇 가지 기본 데이터 시각화 및 정리를 수행합니다." ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "# 요약\n", "\n", "1. 메타변수 생성 -> xgb -> np.mean(metrics.log_loss(실제값, 예측값))\n", "2. TfidfVectorizer -> MultinomialNB -> confusion_matrix-> np.mean(metrics.log_loss(실제값, 예측값))\n", "3. TfidfVectorizer -> TruncatedSVD -> np.mean(metrics.log_loss(실제값, 예측값))\n", "4. CountVectorizer(stop_words='english', ngram_range=(1,3)) -> MultinomialNB -> confusion_matrix -> np.mean(metrics.log_loss(실제값, 예측값))\n", "5. CountVectorizer(ngram_range=(1,7), analyzer='char')\n", "6. TfidfVectorizer(ngram_range=(1,5), analyzer='char') -> TruncatedSVD(n_components=n_comp, algorithm='arpack') -> pd.concat([train_df, train_svd], axis=1) -> 앞서한 예측값들 -> xgb -> xgb.plot_importance -> np.mean(metrics.log_loss(실제값, 예측값))\n" ] }, { "cell_type": "code", "execution_count": 1, "metadata": { "_cell_guid": "b31f62fb-bde8-410e-972c-3c092f22d497", "_uuid": "0fcdf81ce439d2215892af58f839edfc0ca80a91" }, "outputs": [], "source": [ "import numpy as np # linear algebra\n", "import pandas as pd # data processing, CSV file I/O (e.g. pd.read_csv)\n", "import matplotlib.pyplot as plt\n", "import seaborn as sns\n", "\n", "import nltk\n", "from nltk.corpus import stopwords\n", "import string # 문자/숫자의 리스트 출력\n", "\n", "import xgboost as xgb\n", "\n", "from sklearn.feature_extraction.text import TfidfVectorizer, CountVectorizer\n", "from sklearn.decomposition import TruncatedSVD\n", "from sklearn import ensemble, metrics, model_selection, naive_bayes\n", "color = sns.color_palette()\n", "\n", "%matplotlib inline\n", "\n", "eng_stopwords = set(stopwords.words(\"english\"))\n", "pd.options.mode.chained_assignment = None" ] }, { "cell_type": "code", "execution_count": 2, "metadata": { "_cell_guid": "add1b71c-e802-408f-8f62-7ea71ed155cf", "_uuid": "ae86f9515b3be4956e223db4c2472c2c0c40d9fb" }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "Number of rows in train dataset : 19579\n", "Number of rows in test dataset : 8392\n" ] } ], "source": [ "## Read the train and test dataset and check the top few lines ##\n", "train_df = pd.read_csv(\"./input/train.csv\")\n", "test_df = pd.read_csv(\"./input/test.csv\")\n", "print(\"Number of rows in train dataset : \",train_df.shape[0])\n", "print(\"Number of rows in test dataset : \",test_df.shape[0])" ] }, { "cell_type": "code", "execution_count": 3, "metadata": { "_cell_guid": "35e22f81-77f9-438e-ba50-803aea15ad14", "_uuid": "83a9dfc6af30753f82f07ef6162d3b9ba155b06d" }, "outputs": [ { "data": { "text/html": [ "
\n", " | id | \n", "text | \n", "author | \n", "
---|---|---|---|
0 | \n", "id26305 | \n", "This process, however, afforded me no means of... | \n", "EAP | \n", "
1 | \n", "id17569 | \n", "It never once occurred to me that the fumbling... | \n", "HPL | \n", "
2 | \n", "id11008 | \n", "In his left hand was a gold snuff box, from wh... | \n", "EAP | \n", "
3 | \n", "id27763 | \n", "How lovely is spring As we looked from Windsor... | \n", "MWS | \n", "
4 | \n", "id12958 | \n", "Finding nothing else, not even gold, the Super... | \n", "HPL | \n", "
\n", " | id | \n", "text | \n", "author | \n", "num_words | \n", "num_unique_words | \n", "num_chars | \n", "num_stopwords | \n", "num_punctuations | \n", "num_words_upper | \n", "num_words_title | \n", "mean_word_len | \n", "
---|---|---|---|---|---|---|---|---|---|---|---|
0 | \n", "id26305 | \n", "This process, however, afforded me no means of... | \n", "EAP | \n", "41 | \n", "35 | \n", "231 | \n", "19 | \n", "7 | \n", "2 | \n", "3 | \n", "4.658537 | \n", "
1 | \n", "id17569 | \n", "It never once occurred to me that the fumbling... | \n", "HPL | \n", "14 | \n", "14 | \n", "71 | \n", "8 | \n", "1 | \n", "0 | \n", "1 | \n", "4.142857 | \n", "
2 | \n", "id11008 | \n", "In his left hand was a gold snuff box, from wh... | \n", "EAP | \n", "36 | \n", "32 | \n", "200 | \n", "16 | \n", "5 | \n", "0 | \n", "1 | \n", "4.583333 | \n", "
3 | \n", "id27763 | \n", "How lovely is spring As we looked from Windsor... | \n", "MWS | \n", "34 | \n", "32 | \n", "206 | \n", "13 | \n", "4 | \n", "0 | \n", "4 | \n", "5.088235 | \n", "
4 | \n", "id12958 | \n", "Finding nothing else, not even gold, the Super... | \n", "HPL | \n", "27 | \n", "25 | \n", "174 | \n", "11 | \n", "4 | \n", "0 | \n", "2 | \n", "5.481481 | \n", "
... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "
19574 | \n", "id17718 | \n", "I could have fancied, while I looked at it, th... | \n", "EAP | \n", "20 | \n", "19 | \n", "108 | \n", "11 | \n", "3 | \n", "2 | \n", "2 | \n", "4.450000 | \n", "
19575 | \n", "id08973 | \n", "The lids clenched themselves together as if in... | \n", "EAP | \n", "10 | \n", "10 | \n", "55 | \n", "6 | \n", "1 | \n", "0 | \n", "1 | \n", "4.600000 | \n", "
19576 | \n", "id05267 | \n", "Mais il faut agir that is to say, a Frenchman ... | \n", "EAP | \n", "13 | \n", "13 | \n", "68 | \n", "4 | \n", "2 | \n", "0 | \n", "2 | \n", "4.307692 | \n", "
19577 | \n", "id17513 | \n", "For an item of news like this, it strikes us i... | \n", "EAP | \n", "15 | \n", "14 | \n", "74 | \n", "7 | \n", "3 | \n", "0 | \n", "1 | \n", "4.000000 | \n", "
19578 | \n", "id00393 | \n", "He laid a gnarled claw on my shoulder, and it ... | \n", "HPL | \n", "22 | \n", "21 | \n", "109 | \n", "14 | \n", "2 | \n", "0 | \n", "1 | \n", "4.000000 | \n", "
19579 rows × 11 columns
\n", "\n", " | num_words | \n", "num_unique_words | \n", "num_chars | \n", "num_stopwords | \n", "num_punctuations | \n", "num_words_upper | \n", "num_words_title | \n", "mean_word_len | \n", "
---|---|---|---|---|---|---|---|---|
0 | \n", "41 | \n", "35 | \n", "231 | \n", "19 | \n", "7 | \n", "2 | \n", "3 | \n", "4.658537 | \n", "
1 | \n", "14 | \n", "14 | \n", "71 | \n", "8 | \n", "1 | \n", "0 | \n", "1 | \n", "4.142857 | \n", "
2 | \n", "36 | \n", "32 | \n", "200 | \n", "16 | \n", "5 | \n", "0 | \n", "1 | \n", "4.583333 | \n", "
3 | \n", "34 | \n", "32 | \n", "206 | \n", "13 | \n", "4 | \n", "0 | \n", "4 | \n", "5.088235 | \n", "
4 | \n", "27 | \n", "25 | \n", "174 | \n", "11 | \n", "4 | \n", "0 | \n", "2 | \n", "5.481481 | \n", "
... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "
19574 | \n", "20 | \n", "19 | \n", "108 | \n", "11 | \n", "3 | \n", "2 | \n", "2 | \n", "4.450000 | \n", "
19575 | \n", "10 | \n", "10 | \n", "55 | \n", "6 | \n", "1 | \n", "0 | \n", "1 | \n", "4.600000 | \n", "
19576 | \n", "13 | \n", "13 | \n", "68 | \n", "4 | \n", "2 | \n", "0 | \n", "2 | \n", "4.307692 | \n", "
19577 | \n", "15 | \n", "14 | \n", "74 | \n", "7 | \n", "3 | \n", "0 | \n", "1 | \n", "4.000000 | \n", "
19578 | \n", "22 | \n", "21 | \n", "109 | \n", "14 | \n", "2 | \n", "0 | \n", "1 | \n", "4.000000 | \n", "
19579 rows × 8 columns
\n", "\n", " | num_words | \n", "num_unique_words | \n", "num_chars | \n", "num_stopwords | \n", "num_punctuations | \n", "num_words_upper | \n", "num_words_title | \n", "mean_word_len | \n", "
---|---|---|---|---|---|---|---|---|
0 | \n", "19 | \n", "19 | \n", "110 | \n", "9 | \n", "3 | \n", "1 | \n", "3 | \n", "4.842105 | \n", "
1 | \n", "62 | \n", "49 | \n", "330 | \n", "33 | \n", "7 | \n", "1 | \n", "3 | \n", "4.338710 | \n", "
2 | \n", "33 | \n", "30 | \n", "189 | \n", "15 | \n", "3 | \n", "0 | \n", "1 | \n", "4.757576 | \n", "
3 | \n", "41 | \n", "34 | \n", "223 | \n", "19 | \n", "5 | \n", "2 | \n", "3 | \n", "4.463415 | \n", "
4 | \n", "11 | \n", "11 | \n", "53 | \n", "6 | \n", "1 | \n", "1 | \n", "1 | \n", "3.909091 | \n", "
... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "
8387 | \n", "9 | \n", "9 | \n", "42 | \n", "7 | \n", "1 | \n", "0 | \n", "1 | \n", "3.777778 | \n", "
8388 | \n", "7 | \n", "7 | \n", "34 | \n", "4 | \n", "1 | \n", "1 | \n", "1 | \n", "4.000000 | \n", "
8389 | \n", "25 | \n", "24 | \n", "150 | \n", "11 | \n", "2 | \n", "0 | \n", "1 | \n", "5.040000 | \n", "
8390 | \n", "38 | \n", "34 | \n", "197 | \n", "21 | \n", "3 | \n", "2 | \n", "3 | \n", "4.210526 | \n", "
8391 | \n", "38 | \n", "33 | \n", "247 | \n", "18 | \n", "5 | \n", "0 | \n", "1 | \n", "5.526316 | \n", "
8392 rows × 8 columns
\n", "\n", " | num_words | \n", "num_unique_words | \n", "num_chars | \n", "num_stopwords | \n", "num_punctuations | \n", "num_words_upper | \n", "num_words_title | \n", "mean_word_len | \n", "
---|---|---|---|---|---|---|---|---|
1 | \n", "14 | \n", "14 | \n", "71 | \n", "8 | \n", "1 | \n", "0 | \n", "1 | \n", "4.142857 | \n", "
2 | \n", "36 | \n", "32 | \n", "200 | \n", "16 | \n", "5 | \n", "0 | \n", "1 | \n", "4.583333 | \n", "
3 | \n", "34 | \n", "32 | \n", "206 | \n", "13 | \n", "4 | \n", "0 | \n", "4 | \n", "5.088235 | \n", "
5 | \n", "83 | \n", "66 | \n", "468 | \n", "43 | \n", "6 | \n", "5 | \n", "5 | \n", "4.650602 | \n", "
6 | \n", "21 | \n", "21 | \n", "128 | \n", "9 | \n", "5 | \n", "0 | \n", "1 | \n", "5.142857 | \n", "
... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "
19573 | \n", "27 | \n", "24 | \n", "143 | \n", "11 | \n", "4 | \n", "0 | \n", "4 | \n", "4.333333 | \n", "
19574 | \n", "20 | \n", "19 | \n", "108 | \n", "11 | \n", "3 | \n", "2 | \n", "2 | \n", "4.450000 | \n", "
19575 | \n", "10 | \n", "10 | \n", "55 | \n", "6 | \n", "1 | \n", "0 | \n", "1 | \n", "4.600000 | \n", "
19577 | \n", "15 | \n", "14 | \n", "74 | \n", "7 | \n", "3 | \n", "0 | \n", "1 | \n", "4.000000 | \n", "
19578 | \n", "22 | \n", "21 | \n", "109 | \n", "14 | \n", "2 | \n", "0 | \n", "1 | \n", "4.000000 | \n", "
15663 rows × 8 columns
\n", "\n", " | num_words | \n", "num_unique_words | \n", "num_chars | \n", "num_stopwords | \n", "num_punctuations | \n", "num_words_upper | \n", "num_words_title | \n", "mean_word_len | \n", "
---|---|---|---|---|---|---|---|---|
0 | \n", "41 | \n", "35 | \n", "231 | \n", "19 | \n", "7 | \n", "2 | \n", "3 | \n", "4.658537 | \n", "
4 | \n", "27 | \n", "25 | \n", "174 | \n", "11 | \n", "4 | \n", "0 | \n", "2 | \n", "5.481481 | \n", "
13 | \n", "15 | \n", "15 | \n", "86 | \n", "5 | \n", "2 | \n", "0 | \n", "3 | \n", "4.800000 | \n", "
20 | \n", "21 | \n", "20 | \n", "111 | \n", "11 | \n", "2 | \n", "0 | \n", "1 | \n", "4.333333 | \n", "
21 | \n", "44 | \n", "39 | \n", "252 | \n", "24 | \n", "4 | \n", "1 | \n", "2 | \n", "4.750000 | \n", "
... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "
19531 | \n", "7 | \n", "7 | \n", "53 | \n", "3 | \n", "1 | \n", "0 | \n", "1 | \n", "6.714286 | \n", "
19543 | \n", "16 | \n", "16 | \n", "93 | \n", "6 | \n", "2 | \n", "1 | \n", "2 | \n", "4.875000 | \n", "
19550 | \n", "12 | \n", "12 | \n", "59 | \n", "6 | \n", "2 | \n", "0 | \n", "1 | \n", "4.000000 | \n", "
19554 | \n", "21 | \n", "19 | \n", "126 | \n", "10 | \n", "2 | \n", "0 | \n", "1 | \n", "5.047619 | \n", "
19576 | \n", "13 | \n", "13 | \n", "68 | \n", "4 | \n", "2 | \n", "0 | \n", "2 | \n", "4.307692 | \n", "
3916 rows × 8 columns
\n", "\n", " | id | \n", "text | \n", "num_words | \n", "num_unique_words | \n", "num_chars | \n", "num_stopwords | \n", "num_punctuations | \n", "num_words_upper | \n", "num_words_title | \n", "mean_word_len | \n", "... | \n", "svd_word_13 | \n", "svd_word_14 | \n", "svd_word_15 | \n", "svd_word_16 | \n", "svd_word_17 | \n", "svd_word_18 | \n", "svd_word_19 | \n", "nb_cvec_eap | \n", "nb_cvec_hpl | \n", "nb_cvec_mws | \n", "
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | \n", "id02310 | \n", "Still, as I urged our leaving Ireland with suc... | \n", "19 | \n", "19 | \n", "110 | \n", "9 | \n", "3 | \n", "1 | \n", "3 | \n", "4.842105 | \n", "... | \n", "-0.011837 | \n", "0.036064 | \n", "-0.016591 | \n", "-0.025580 | \n", "-0.018785 | \n", "0.031289 | \n", "-0.047220 | \n", "0.021018 | \n", "0.000595 | \n", "0.978387 | \n", "
1 | \n", "id24541 | \n", "If a fire wanted fanning, it could readily be ... | \n", "62 | \n", "49 | \n", "330 | \n", "33 | \n", "7 | \n", "1 | \n", "3 | \n", "4.338710 | \n", "... | \n", "-0.004397 | \n", "-0.000020 | \n", "-0.008583 | \n", "0.006335 | \n", "-0.004216 | \n", "0.001810 | \n", "0.001767 | \n", "0.999985 | \n", "0.000009 | \n", "0.000006 | \n", "
2 | \n", "id00134 | \n", "And when they had broken down the frail door t... | \n", "33 | \n", "30 | \n", "189 | \n", "15 | \n", "3 | \n", "0 | \n", "1 | \n", "4.757576 | \n", "... | \n", "0.006063 | \n", "-0.003324 | \n", "-0.009452 | \n", "0.013239 | \n", "0.004852 | \n", "-0.007478 | \n", "0.002786 | \n", "0.217325 | \n", "0.782527 | \n", "0.000148 | \n", "
3 | \n", "id27757 | \n", "While I was thinking how I should possibly man... | \n", "41 | \n", "34 | \n", "223 | \n", "19 | \n", "5 | \n", "2 | \n", "3 | \n", "4.463415 | \n", "... | \n", "0.004783 | \n", "-0.006865 | \n", "-0.007960 | \n", "0.006763 | \n", "0.002540 | \n", "-0.004558 | \n", "-0.000728 | \n", "0.753591 | \n", "0.246408 | \n", "0.000001 | \n", "
4 | \n", "id04081 | \n", "I am not sure to what limit his knowledge may ... | \n", "11 | \n", "11 | \n", "53 | \n", "6 | \n", "1 | \n", "1 | \n", "1 | \n", "3.909091 | \n", "... | \n", "-0.001825 | \n", "0.000123 | \n", "-0.006504 | \n", "0.002533 | \n", "-0.004004 | \n", "-0.002902 | \n", "0.000315 | \n", "0.970950 | \n", "0.021824 | \n", "0.007226 | \n", "
... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "
8387 | \n", "id11749 | \n", "All this is now the fitter for my purpose. | \n", "9 | \n", "9 | \n", "42 | \n", "7 | \n", "1 | \n", "0 | \n", "1 | \n", "3.777778 | \n", "... | \n", "0.000610 | \n", "-0.001496 | \n", "0.001063 | \n", "-0.001688 | \n", "-0.005155 | \n", "-0.003901 | \n", "0.001965 | \n", "0.565949 | \n", "0.106068 | \n", "0.327983 | \n", "
8388 | \n", "id10526 | \n", "I fixed myself on a wide solitude. | \n", "7 | \n", "7 | \n", "34 | \n", "4 | \n", "1 | \n", "1 | \n", "1 | \n", "4.000000 | \n", "... | \n", "-0.005626 | \n", "-0.006015 | \n", "-0.008673 | \n", "0.005675 | \n", "0.004995 | \n", "-0.004539 | \n", "-0.000283 | \n", "0.031598 | \n", "0.033443 | \n", "0.934959 | \n", "
8389 | \n", "id13477 | \n", "It is easily understood that what might improv... | \n", "25 | \n", "24 | \n", "150 | \n", "11 | \n", "2 | \n", "0 | \n", "1 | \n", "5.040000 | \n", "... | \n", "-0.006167 | \n", "0.000023 | \n", "-0.012848 | \n", "0.006857 | \n", "-0.005566 | \n", "0.006634 | \n", "0.011775 | \n", "0.999709 | \n", "0.000120 | \n", "0.000171 | \n", "
8390 | \n", "id13761 | \n", "Be this as it may, I now began to feel the ins... | \n", "38 | \n", "34 | \n", "197 | \n", "21 | \n", "3 | \n", "2 | \n", "3 | \n", "4.210526 | \n", "... | \n", "-0.010635 | \n", "-0.001895 | \n", "-0.004086 | \n", "-0.001623 | \n", "-0.002973 | \n", "-0.007277 | \n", "0.004486 | \n", "0.000689 | \n", "0.000005 | \n", "0.999307 | \n", "
8391 | \n", "id04282 | \n", "Long winded, statistical, and drearily genealo... | \n", "38 | \n", "33 | \n", "247 | \n", "18 | \n", "5 | \n", "0 | \n", "1 | \n", "5.526316 | \n", "... | \n", "-0.003923 | \n", "-0.006325 | \n", "-0.006186 | \n", "-0.007940 | \n", "0.009861 | \n", "0.017919 | \n", "-0.013286 | \n", "0.024705 | \n", "0.975287 | \n", "0.000008 | \n", "
8392 rows × 33 columns
\n", "\n", " | num_words | \n", "num_unique_words | \n", "num_chars | \n", "num_stopwords | \n", "num_punctuations | \n", "num_words_upper | \n", "num_words_title | \n", "mean_word_len | \n", "svd_word_0 | \n", "svd_word_1 | \n", "... | \n", "svd_char_10 | \n", "svd_char_11 | \n", "svd_char_12 | \n", "svd_char_13 | \n", "svd_char_14 | \n", "svd_char_15 | \n", "svd_char_16 | \n", "svd_char_17 | \n", "svd_char_18 | \n", "svd_char_19 | \n", "
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | \n", "19 | \n", "19 | \n", "110 | \n", "9 | \n", "3 | \n", "1 | \n", "3 | \n", "4.842105 | \n", "0.024516 | \n", "-0.010185 | \n", "... | \n", "-0.077185 | \n", "-0.006559 | \n", "0.005616 | \n", "0.020747 | \n", "-0.065907 | \n", "0.053055 | \n", "0.040820 | \n", "-0.060882 | \n", "0.010134 | \n", "-0.003119 | \n", "
1 | \n", "62 | \n", "49 | \n", "330 | \n", "33 | \n", "7 | \n", "1 | \n", "3 | \n", "4.338710 | \n", "0.022294 | \n", "-0.011968 | \n", "... | \n", "0.016415 | \n", "0.011792 | \n", "0.013476 | \n", "0.083765 | \n", "-0.027404 | \n", "0.055122 | \n", "-0.067695 | \n", "-0.003033 | \n", "0.029863 | \n", "0.006638 | \n", "
2 | \n", "33 | \n", "30 | \n", "189 | \n", "15 | \n", "3 | \n", "0 | \n", "1 | \n", "4.757576 | \n", "0.016906 | \n", "-0.008934 | \n", "... | \n", "0.011926 | \n", "0.022561 | \n", "0.042856 | \n", "-0.001981 | \n", "0.058037 | \n", "-0.027065 | \n", "-0.004207 | \n", "0.026371 | \n", "-0.023764 | \n", "0.021925 | \n", "
3 | \n", "41 | \n", "34 | \n", "223 | \n", "19 | \n", "5 | \n", "2 | \n", "3 | \n", "4.463415 | \n", "0.013408 | \n", "-0.007515 | \n", "... | \n", "-0.042390 | \n", "-0.056576 | \n", "0.053759 | \n", "-0.026924 | \n", "0.070788 | \n", "0.000571 | \n", "-0.033236 | \n", "-0.071819 | \n", "0.017313 | \n", "-0.027750 | \n", "
4 | \n", "11 | \n", "11 | \n", "53 | \n", "6 | \n", "1 | \n", "1 | \n", "1 | \n", "3.909091 | \n", "0.012565 | \n", "-0.003185 | \n", "... | \n", "0.006790 | \n", "0.042263 | \n", "-0.018383 | \n", "-0.039678 | \n", "0.017349 | \n", "-0.002632 | \n", "-0.014737 | \n", "0.061780 | \n", "-0.055344 | \n", "0.054694 | \n", "
... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "
8387 | \n", "9 | \n", "9 | \n", "42 | \n", "7 | \n", "1 | \n", "0 | \n", "1 | \n", "3.777778 | \n", "0.006477 | \n", "-0.004209 | \n", "... | \n", "0.040577 | \n", "0.057288 | \n", "0.039300 | \n", "-0.064571 | \n", "0.013609 | \n", "0.037641 | \n", "-0.057116 | \n", "0.055621 | \n", "-0.021832 | \n", "0.040692 | \n", "
8388 | \n", "7 | \n", "7 | \n", "34 | \n", "4 | \n", "1 | \n", "1 | \n", "1 | \n", "4.000000 | \n", "0.011401 | \n", "-0.006803 | \n", "... | \n", "0.014870 | \n", "-0.046629 | \n", "0.011422 | \n", "-0.052563 | \n", "0.055345 | \n", "0.014132 | \n", "-0.028990 | \n", "-0.036881 | \n", "-0.017690 | \n", "0.022852 | \n", "
8389 | \n", "25 | \n", "24 | \n", "150 | \n", "11 | \n", "2 | \n", "0 | \n", "1 | \n", "5.040000 | \n", "0.024211 | \n", "-0.013763 | \n", "... | \n", "0.022989 | \n", "0.014606 | \n", "0.000173 | \n", "-0.044822 | \n", "-0.065277 | \n", "0.018879 | \n", "-0.007556 | \n", "0.019218 | \n", "0.024902 | \n", "-0.025109 | \n", "
8390 | \n", "38 | \n", "34 | \n", "197 | \n", "21 | \n", "3 | \n", "2 | \n", "3 | \n", "4.210526 | \n", "0.025443 | \n", "-0.013794 | \n", "... | \n", "-0.043491 | \n", "0.046127 | \n", "-0.041112 | \n", "-0.021125 | \n", "-0.030812 | \n", "0.020161 | \n", "0.039208 | \n", "-0.055308 | \n", "-0.040508 | \n", "-0.018515 | \n", "
8391 | \n", "38 | \n", "33 | \n", "247 | \n", "18 | \n", "5 | \n", "0 | \n", "1 | \n", "5.526316 | \n", "0.021897 | \n", "-0.010181 | \n", "... | \n", "-0.004355 | \n", "-0.021253 | \n", "0.018119 | \n", "0.031552 | \n", "-0.038812 | \n", "-0.014792 | \n", "0.052156 | \n", "0.022833 | \n", "-0.002455 | \n", "-0.028240 | \n", "
8392 rows × 57 columns
\n", "