Re: Recordset - I stuck! Help! - Database版

本页内容为未名空间相应帖子的节选和存档，一周内的贴子最多显示50字，超过一周显示500字访问原贴

Database版 - Re: Recordset - I stuck! Help!

相关主题
● query questions	● how to get the result in the middle of resultset?
● 问一个简单的Query	● [转载] 求教数据库的query optimization 工作的面试
● Help on Oracle Query	● how to make this query
● Need help, about select count(xyz) from ...	● 有什么优化query的常用方法
● sql question	● sql server 怎么关掉log
● 请教怎么来log duration of a MYSQL procedure?	● query 求助
● 再问个excel问题吧	● Import data in R
● 问一个query	● 怎么求和

相关话题的讨论汇总
话题: recordset话题: excel话题: db话题: 100话题: stuck

进入Database版参与讨论

(共1页)

u***t
发帖数: 3986

I have a huge recordset (50 millions rows x N columns)
I am going to break the recordset into 100 smaller sub-recordsets with 500,
000 rows each, by setting pagesize:
rs.PageSize = 500,000
Now, I got 100 pages of this recordsets.
Questions:
How do I export all these 100 pages into 100 excel files? Or say, I do I
treat these 100 pages as 100 sub-recordsets?
I am using below methods to load the data to excel:
ojbExcelWorkSheet.Range.CopyFromRecordset rs
Anybody?
* I don't want to treat records individually, instead I must them in whole
as recordset.
Thanks!

w*r
发帖数: 2421

do not use that, page the record set with that many record will eat
a lot of memory in VB engine.
You should consider this:
use loop with counter to write data directly to CSV file
when counter goes up to 500K, close csv file and reopen another one
with file handler, reset counter to zero, keep writing.
with single record operation for the loop, it will be slower, however, you
can further optimize it by having a smller page size to optimize the data
fetch side oepration (handled by ado.net automatically), then in you own
program, using stringbuffer to temporarily store text and write to disks in
"chuncks" by flush the stringbuffer to disk file every 100 records or 1000
records. that will improve IO efficiency.
in this case, fixed size record text works better when you have better
control how big the bugger is and setup appropriated string/stream writers
in your vb code.

【在 u***t 的大作中提到】

: I have a huge recordset (50 millions rows x N columns)
: I am going to break the recordset into 100 smaller sub-recordsets with 500,
: 000 rows each, by setting pagesize:
: rs.PageSize = 500,000
: Now, I got 100 pages of this recordsets.
: Questions:
: How do I export all these 100 pages into 100 excel files? Or say, I do I
: treat these 100 pages as 100 sub-recordsets?
: I am using below methods to load the data to excel:
: ojbExcelWorkSheet.Range.CopyFromRecordset rs

g***l
发帖数: 18555

EXCEL不是用来存储数据的，那么多记录，所以才用数据库啊，EXCEL管用，人家数据库
不都倒闭了。

i****a
发帖数: 36252

question is, why do you want to have 100 excel files with 500k records each?

【在 u***t 的大作中提到】

g***l
发帖数: 18555

因为EXCEL有个65535的RECORD限制

each?

【在 i****a 的大作中提到】

: question is, why do you want to have 100 excel files with 500k records each?

i****a
发帖数: 36252

Excel 2010 limit is higher.
but my question is what LZ wants to do with records in excel. there maybe
a better way than 100 excel files
and since LZ asked in DB board, I assume the data is from a database. if
LZ has a real need to have 100 excel files, exporting directly from
database is more efficient.

【在 g***l 的大作中提到】

: 因为EXCEL有个65535的RECORD限制
:
: each?

g***l
发帖数: 18555

输出到CSV或者ACCESS不就得了，问题是那么大你怎么QUERY啊，找什么数据，他们自己
都不知道吧，EXCEL去拔份几百万的数据，有点可笑

u***t
发帖数: 3986

Excel is used to store data from records using query provided by offshore
resource, after writing the recordset to file, buffer is released.
Sequential looping is far slower during execution. That's why I want to use
process the comparison using recordset.
The reason I don't export directly from DB to file, is that only data with
certain criteria is needed from source DB...
bottomline, looping isn't efficient...

u***t
发帖数: 3986

500,000 is the optimal maxrownum that an excel can hold. Get 500k from
source, then get 500k from target, then use a query to compare them, write
mismatch to a third excel file, takes 10 minutes.
With loop on the source, whole day.

each?

【在 i****a 的大作中提到】

: question is, why do you want to have 100 excel files with 500k records each?

g***l
发帖数: 18555

这种要让专业人士笑死的。

相关主题
● 请教怎么来log duration of a MYSQL procedure?	● how to get the result in the middle of resultset?
● 再问个excel问题吧	● [转载] 求教数据库的query optimization 工作的面试
● 问一个query	● how to make this query
进入Database版参与讨论

u***t
发帖数: 3986

I have yet to see a solution from you, pro!

【在 g***l 的大作中提到】

: 这种要让专业人士笑死的。

w*r
发帖数: 2421

totall confused... sorce target are in different DB instance??? then you
should consider load them into same db instance and use DB engine to do such
comparison

【在 u***t 的大作中提到】

: 500,000 is the optimal maxrownum that an excel can hold. Get 500k from
: source, then get 500k from target, then use a query to compare them, write
: mismatch to a third excel file, takes 10 minutes.
: With loop on the source, whole day.
:
: each?

u***t
发帖数: 3986

Use the same Queries from offsource developers to clone a 2nd DB schema for
testing purpose, does that work?!

such

【在 w*r 的大作中提到】

: totall confused... sorce target are in different DB instance??? then you
: should consider load them into same db instance and use DB engine to do such
: comparison

g***l
发帖数: 18555

弄个BACKUP ZIP一下，下载下来RESTORE一个不就完了，还EXCEL，还要IMPORT多个文件，最后这里出错那里出错的，笑死。

for

【在 u***t 的大作中提到】

: Use the same Queries from offsource developers to clone a 2nd DB schema for
: testing purpose, does that work?!
:
: such

u***t
发帖数: 3986

Dev's query is based on business rule.
Testing's query is also based on business rule.
my point is using Dev's queries doesn't serve the purpose of data validation.

件，最后这里出错那里出错的，笑死。

【在 g***l 的大作中提到】

: 弄个BACKUP ZIP一下，下载下来RESTORE一个不就完了，还EXCEL，还要IMPORT多个文件，最后这里出错那里出错的，笑死。
:
: for

B*****g
发帖数: 34098

能写中文吗？

validation.

【在 u***t 的大作中提到】

: Dev's query is based on business rule.
: Testing's query is also based on business rule.
: my point is using Dev's queries doesn't serve the purpose of data validation.
:
: 件，最后这里出错那里出错的，笑死。

g***l
发帖数: 18555

什么不是BASED ON BUSINESS RULE啊，现在是说数据，大数据量传输怎么能用EXCEL呢
？EXCEL不是数据库，多个文件就是自找麻烦

validation.

【在 u***t 的大作中提到】

(共1页)

进入Database版参与讨论

相关主题
● 怎么求和	● sql question
● 这个 query 为什么可以 update multiple rows	● 请教怎么来log duration of a MYSQL procedure?
● 这个 Oracle SQL 语句该这么写啊？	● 再问个excel问题吧
● How to write this loop SQL query?	● 问一个query
● query questions	● how to get the result in the middle of resultset?
● 问一个简单的Query	● [转载] 求教数据库的query optimization 工作的面试
● Help on Oracle Query	● how to make this query
● Need help, about select count(xyz) from ...	● 有什么优化query的常用方法

相关话题的讨论汇总
话题: recordset话题: excel话题: db话题: 100话题: stuck

#	版面	帖数(主题数)
-	全站	4871 (796)
1	Military	3777 (569)
2	Stock	341 (51)
3	Joke	117 (17)
4	History	116 (3)
5	Automobile	100 (9)
6	USANews	55 (9)
7	Midlife	45 (1)
8	Headline	41 (41)
9	Dreamer	33 (13)
10	FleaMarket	32 (20)
11	Living	30 (7)

boards

未名新帖统计// 7月16日

历史上的今天